Multi-Agent

Catalog entries using this tag (links open the entry card on its page):

Entries

AutoResearchClaw

AI Auto Research Scientific Discovery Multi-Agent Human-in-the-Loop Open Source

FULL NAME

AutoResearchClaw — Self-Reinforcing Autonomous Research with Human-AI Collaboration

DESCRIPTION

AutoResearchClaw is an open-source 23-stage autonomous research pipeline from UNC Chapel Hill that turns a research idea into a conference-ready LaTeX paper. Features: multi-agent debate for hypothesis generation, self-healing executor with Pivot/Refine decision loop, verifiable result reporting preventing hallucinations, human-in-the-loop with 7 intervention modes, and cross-run evolution. Outperforms AI Scientist v2 by 54.7% on ARC-Bench. MIT licensed.

Show full descriptionShow less

URL

https://github.com/aiming-lab/AutoResearchClaw

Main citation

AIMING Lab. (2026) AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration. arXiv:2605.20025. doi:10.48550/arXiv.2605.20025

ABSTRACT

Automating scientific discovery requires more than generating papers from ideas. Real research is iterative: hypotheses are challenged from multiple perspectives, experiments fail and inform the next attempt, and lessons accumulate across cycles. We present AutoResearchClaw, a multi-agent autonomous research pipeline built on five mechanisms: structured multi-agent debate for hypothesis generation and result analysis, a self-healing executor with a Pivot/Refine decision loop that transforms failures into information, verifiable result reporting that prevents fabricated numbers and hallucinated citations, human-in-the-loop collaboration with seven intervention modes, and cross-run evolution that converts past mistakes into future safeguards. On ARC-Bench, AutoResearchClaw outperforms AI Scientist v2 by 54.7%.

Show full abstractShow less

DOI

10.48550/arXiv.2605.20025

BioMedAgent

AI Agent Biomedical Multi-Agent Bioinformatics

PUBMED_LINK

41912700

FULL NAME

BioMedAgent: self-evolving multi-agent LLM framework for biomedical data analysis

DESCRIPTION

BioMedAgent is a self-evolving LLM multi-agent framework that learns to use diverse bioinformatics tools and chain them into executable workflows through interactive exploration and memory retrieval algorithms. It allows biomedical users to initiate tasks using natural language, without requiring computational expertise. Evaluated on BioMed-AQA benchmark (327 biomedical data tasks), BioMedAgent achieved a 77% success rate, surpassing other LLM agents, and generalized robustly to the external BixBench dataset. Beyond benchmarks, it autonomously performs cross-omics analysis, machine-learning modelling and pathology image segmentation.

Show full descriptionShow less

URL

https://www.nature.com/articles/s41551-026-01634-6

TITLE

Empowering AI data scientists using a multi-agent LLM framework with self-evolving capabilities for autonomous, tool-aware biomedical data analyses.

Main citation

Bu D, Sun J, Li K, He Z, Huang W, Hu J, Zhang S, Lei S, Huo P, Wang Z, Wang S, Wang T, Gao K, Wu Y, Zhao L, Wang K, Li G, Song H, Jin Y, Zhang K, Chen R, Zhao Y. (2026) Empowering AI data scientists using a multi-agent LLM framework with self-evolving capabilities for autonomous, tool-aware biomedical data analyses. Nature Biomedical Engineering. doi:10.1038/s41551-026-01634-6. PMID 41912700

ABSTRACT

Artificial intelligence agents are emerging as powerful applications of large language models (LLMs), automating complex tasks and enabling scientific data exploration. However, their use in biomedical data analysis remains limited by the difficulty of handling specialized tools and multistep reasoning. Here we introduce BioMedAgent, a self-evolving LLM multi-agent framework, which learns to use diverse bioinformatics tools and chain them into executable workflows through interactive exploration and memory retrieval algorithms. It allows biomedical users to initiate tasks using natural language, without requiring computational expertise. Evaluated on our newly released BioMed-AQA benchmark comprising 327 biomedical data tasks, BioMedAgent achieved a 77% success rate, surpassing other LLM agents, and generalized robustly to the external BixBench dataset. Beyond benchmarks, it autonomously performs cross-omics analysis, machine-learning modelling and pathology image segmentation, highlighting its potential to advance biomedical research and extend to other scientific domains requiring complex tool integration and multistep reasoning.

Show full abstractShow less

DOI

10.1038/s41551-026-01634-6

Co-Scientist

AI Agent Scientific Discovery Multi-Agent Hypothesis Generation Gemini

PUBMED_LINK

42156544

FULL NAME

Co-Scientist - A Multi-Agent AI System for Accelerating Scientific Discovery

DESCRIPTION

Co-Scientist is a multi-agent AI system built on Gemini for structured scientific thinking and hypothesis generation. It aims to help scientists discover new original knowledge by formulating demonstrably novel research hypotheses for experimental validation, conditioned on research objectives and prior scientific evidence.

Show full descriptionShow less

TITLE

Accelerating scientific discovery with Co-Scientist.

Main citation

Gottweis J, Weng WH, Daryin A, Tu T, Sirkovic P, Myaskovsky A, Glowaty G, Weissenberger F, Orlandi A, Popovici D, Palepu A, Rong K, Tanno R, Saab K, Zhang F, Blum J, Carroll A, Kulkarni K, Tomašev N, Zverinski D, Rendulic I, Vedadi E, Hasler F, Rimanic L, Boia M, Budiselic I, Feinstein B, Bellaiche M, Sheffer T, Freyberg J, Ratcliff J, Bertolli O, Chou K, Hassidim A, Gokturk B, Vahdat A, Guan Y, Dhillon V, Vaishnav ED, Lee B, Costa TRD, Penadés JR, Peltz G, Matias Y, Manyika J, Hassabis D, Xu Y, Kohli P, Pawlosky A, Karthikesalingam A, Natarajan V. (2026) Accelerating scientific discovery with Co-Scientist. Nature. doi:10.1038/s41586-026-10644-y. PMID 42156544

ABSTRACT

Scientific discovery is driven by scientists generating novel hypotheses for complex problems that undergo rigorous experimental validation. To augment this process, we introduce Co-Scientist, a multi-agent AI system built on Gemini for structured scientific thinking and hypothesis generation. Co-Scientist aims to help scientists discover new original knowledge. Conditioned on their research objectives and prior scientific evidence, it formulates demonstrably novel research hypotheses for experimental validation.

Show full abstractShow less

DOI

10.1038/s41586-026-10644-y

DeepRare

AI Agent Rare Disease Diagnosis Multi-Agent Clinical AI

PUBMED_LINK

41708847

FULL NAME

DeepRare - A Multi-Agent System for Rare Disease Diagnosis with Traceable Reasoning

DESCRIPTION

DeepRare is a multi-agent system for rare disease differential diagnosis that integrates large language models with structured medical knowledge to provide traceable reasoning. It aims to reduce the diagnostic odyssey for rare disease patients by leveraging AI agents to analyze clinical phenotypes, genomic data, and medical literature in a transparent, interpretable manner.

Show full descriptionShow less

TITLE

An agentic system for rare disease diagnosis with traceable reasoning.

ABSTRACT

Rare diseases affect more than 300 million people worldwide, yet timely and accurate diagnosis remains an urgent challenge. Patients often endure a prolonged 'diagnostic odyssey' exceeding 5 years, marked by repeated referrals, misdiagnoses and unnecessary interventions, leading to delayed treatment and substantial emotional and economic burden. Here we present DeepRare, a multi-agent system for rare disease differential diagnosis decision support with traceable reasoning.

Show full abstractShow less

DOI

10.1038/s41586-025-10097-9

Denario

AI Auto Research Scientific Discovery Multi-Agent arXiv Flatiron Institute

FULL NAME

Denario — Deep Knowledge AI Agents for Scientific Discovery

DESCRIPTION

Denario is an AI multi-agent system from the Flatiron Institute (Simons Foundation) designed to serve as a scientific research assistant across disciplines. It can generate ideas, check literature for novelty, develop research plans, write and execute code, make plots, and draft and review scientific papers. Demonstrated across 11 AI-generated paper drafts in astrophysics, biology, biophysics, chemistry, material science, medicine, neuroscience and more. Excels at combining ideas across disciplines (e.g., quantum physics + ML applied to astrophysics).

Show full descriptionShow less

URL

https://github.com/AstroPilot-AI/Denario

Main citation

Villaescusa-Navarro F, Bolliet B, Villanueva-Domingo P, et al. (2025) The Denario project: Deep knowledge AI agents for scientific discovery. arXiv:2510.26887. doi:10.48550/arXiv.2510.26887

ABSTRACT

We present Denario, an AI multi-agent system designed to serve as a scientific research assistant. Denario can perform many different tasks, such as generating ideas, checking the literature, developing research plans, writing and executing code, making plots, and drafting and reviewing a scientific paper. In this work, we describe in detail Denario and its modules, and illustrate its capabilities by presenting multiple AI-generated papers generated by it in many different scientific disciplines such as astrophysics, biology, biophysics, biomedical informatics, chemistry, material science, mathematical physics, medicine, neuroscience and planetary science. Denario also excels at combining ideas from different disciplines.

Show full abstractShow less

DOI

10.48550/arXiv.2510.26887

Robin

AI Agent Scientific Discovery Multi-Agent Automation Biology

PUBMED_LINK

42156546

FULL NAME

Robin - A Multi-Agent System for Automating Scientific Discovery

DESCRIPTION

Robin is the first multi-agent system capable of fully automating both hypothesis generation and data analysis for experimental biology. By integrating literature search agents with data analysis agents, Robin can generate testable hypotheses from literature and design experiments to validate them, automating the entire scientific discovery cycle for biological research.

Show full descriptionShow less

TITLE

A multi-agent system for automating scientific discovery.

Main citation

Ghareeb AE, Chang B, Mitchener L, Yiu A, Szostkiewicz CJ, Shved D, Gyimesi GJ, Laurent JM, Wright SM, Razzak MT, White AD, Finnemann SC, Hinks MM, Rodriques SG. (2026) A multi-agent system for automating scientific discovery. Nature. doi:10.1038/s41586-026-10652-y. PMID 42156546

ABSTRACT

Scientific discovery is driven by the iterative process of observation, hypothesis generation, experimentation, and data analysis. Despite recent advancements in applying artificial intelligence to biology, no system has yet automated all these stages. Here, we introduce Robin, the first multi-agent system capable of fully automating both hypothesis generation and data analysis for experimental biology. By integrating literature search agents with data analysis agents, Robin can generate testable hypotheses from literature and design experiments to validate them.

Show full abstractShow less

DOI

10.1038/s41586-026-10652-y

SciToolAgent

AI Agent Knowledge Graph Scientific Computing Tool Integration Multi-Agent

PUBMED_LINK

40835791

FULL NAME

SciToolAgent - A Knowledge-Graph-Driven Scientific Agent for Multitool Integration

DESCRIPTION

SciToolAgent is a knowledge-graph-driven scientific agent that integrates multiple computational tools for scientific research. It leverages a knowledge graph to understand tool capabilities and relationships, enabling automated multi-tool workflow composition for complex scientific tasks across domains including bioinformatics, cheminformatics, and materials science.

Show full descriptionShow less

TITLE

SciToolAgent: a knowledge-graph-driven scientific agent for multitool integration.

Main citation

Ding K, Yu J, Huang J, Yang Y, Zhang Q, Chen H. (2025) SciToolAgent: a knowledge-graph-driven scientific agent for multitool integration. Nature Computational Science, 5(10):962-972. doi:10.1038/s43588-025-00849-y. PMID 40835791

ABSTRACT

Scientific research increasingly relies on specialized computational tools, yet effectively utilizing these tools requires substantial domain expertise. While large language models show promise in tool use, they struggle with complex multi-tool workflows. Here we introduce SciToolAgent, a knowledge-graph-driven scientific agent that integrates multiple computational tools for scientific research. By leveraging a knowledge graph to understand tool capabilities and relationships, SciToolAgent enables automated multi-tool workflow composition for complex scientific tasks.

Show full abstractShow less

DOI

10.1038/s43588-025-00849-y

SPARK

AI Agent Pathology Cancer Multi-Agent Biomedical

PUBMED_LINK

42056496

FULL NAME

SPARK (System of Pathology Agents for Research and Knowledge)

DESCRIPTION

SPARK (System of Pathology Agents for Research and Knowledge) is a foundational agentic AI framework that uses language as a universal interface to autonomously generate biologically driven concepts for tumor analysis. It functions as a pathology 'brain' — an interconnected system of AI agents that autonomously reason, generate and implement biologically meaningful hypotheses as analytical tools without additional model training. SPARK uses four linked modules: idea generation (OpenAI o1), idea refinement, parameter coding (Claude Sonnet 3.5), and parameter verification. Evaluated across 18 patient cohorts spanning 5 cancer types and >5,400 patients, SPARK produced clinically and biologically relevant concepts correlated with prognosis, pathological variables, and predictive biomarkers, including patterns of tumor progression inferred from static images.

Show full descriptionShow less

URL

https://github.com/cpath-ukk/SPARK

TITLE

An agentic framework for autonomous scientific discovery in cancer pathology.

Main citation

Trost F, Zhang B, Aring J, Glamann L, Wessolly M, Johnson K, Göbel H, Lerbs T, Sangenne T, Herrmann P, Mairinger F, Kopp C, Michels S, Rasokat A, Heldwein M, Wagner S, Schömig-Markiefka B, Wolf J, Hartmann S, Wickenhauser C, Bychkov A, Klussmann JP, Quaas A, Buettner R, Tolkach Y. (2026) An agentic framework for autonomous scientific discovery in cancer pathology. Nature Medicine. doi:10.1038/s41591-026-04357-y. PMID 42056496

ABSTRACT

Artificial intelligence has advanced cancer pathology, but many systems still depend on hand-crafted features, are hard to explain and rely on fragmented workflows. We introduce SPARK (System of Pathology Agents for Research and Knowledge), a foundational agentic artificial intelligence approach that uses language as a universal interface to autonomously generate biologically driven concepts for tumor analysis. SPARK turns biological ideas into analytical tools and works directly with complex pathology data without extra model training. We evaluated SPARK across 18 patient cohorts spanning five cancer types (lung adenocarcinoma, lung squamous cell carcinoma, colorectal cancer, breast cancer and oropharyngeal squamous cell carcinoma) and more than 5,400 patients with available histopathology images and clinical/follow-up information, in both prognostic and predictive settings and on a well characterized spatial biology breast cancer dataset (n=625). We found that SPARK produced clinically and biologically relevant concepts correlated with prognosis, known pathological variables and predictive biomarkers, including patterns of tumor progression and temporal change inferred from static images. A dedicated module allows for human interaction with SPARK. All code, parameters and results are openly released.

Show full abstractShow less

DOI

10.1038/s41591-026-04357-y

Virtual Lab

AI Agent Virtual Lab Nanobody SARS-CoV-2 Drug Discovery Multi-Agent

PUBMED_LINK

40730228

FULL NAME

Virtual Lab - AI Agent Teams for Scientific Discovery

DESCRIPTION

The Virtual Lab is an AI agent framework that uses LLM-powered researchers in a simulated laboratory environment to collaboratively design and test scientific hypotheses. It was demonstrated by successfully designing new SARS-CoV-2 nanobodies, with AI agents specializing in different scientific roles working together to propose, evaluate, and refine experimental designs.

Show full descriptionShow less

URL

https://github.com/kyle-swanson/virtual-lab

TITLE

The Virtual Lab of AI agents designs new SARS-CoV-2 nanobodies.

ABSTRACT

Science frequently benefits from teams of interdisciplinary researchers, but many scientists do not have easy access to experts from multiple fields. Although large language models (LLMs) have shown an impressive ability to aid researchers across diverse domains, their uses have been largely limited to answering specific scientific questions rather than performing open-ended research. Here we expand the capabilities of LLMs for science by introducing the Virtual Lab, a framework where LLM-powered AI agents collaborate in a simulated laboratory to design and test scientific hypotheses. The Virtual Lab successfully designed new SARS-CoV-2 nanobodies, demonstrating the potential of multi-agent AI systems for open-ended scientific discovery.

Show full abstractShow less

DOI

10.1038/s41586-025-09442-9