Chart from the research paper with words: Community paper reading on the left side

Multiagent Finetuning: A Conversation with Researcher Yilun Du

Published Feb 4, 2025

Paper Readings

Sarah Welsh

Contributor

This week we were excited to talk to Google DeepMind Senior Research Scientist (and incoming Assistant Professor at Harvard), Yilun Du, about his latest paper “Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains.” This research introduces a multiagent finetuning framework that enhances the performance and diversity of language models by employing a society of agents with distinct roles, improving feedback mechanisms and overall output quality.

The method enables autonomous self-improvement through iterative finetuning, achieving significant performance gains across various reasoning tasks. It is versatile, applicable to both open-source and proprietary LLMs, and can integrate with human-feedback-based methods like RLHF or DPO, paving the way for future advancements in language model development. Read on for a full overview of the discussion, or catch a full recap of the talk below.

Watch

Catch the full discussion here:

Listen

Learn more

Summary and Overview

In this conversation, we explored a novel approach to overcoming the limitations of traditional LLM fine-tuning. Instead of a single model iteratively training on its own generated data, this method leverages a team of specialized models that learn from each other, maintaining diversity in reasoning and sustaining long-term performance gains.

Here’s an overview of what we discussed, and some key elements of the research itself.

Key Concepts in Multi-Agent Fine-Tuning

SallyAnn Delucia and Yilun Du screenshot from zoom recording — SallyAnn DeLucia and Yilun Du

Traditional fine-tuning methods often struggle with stagnation after a few iterations, limiting their effectiveness in long-term learning. Multi-agent fine-tuning addresses this by introducing a collaborative learning approach where multiple models interact, specialize, and iteratively improve together. This structured cooperation enables models to refine their reasoning strategies while preserving diversity. Below are the fundamental principles that drive this process:

Multiple Agents: The process begins with a single base language model, which is copied to create multiple agents that take on distinct roles.
Specialization of Roles:
1. Generation Agents: Some copies of the model function as “generation agents,” responsible for producing initial responses to problems.
2. Critic Agents: Other copies serve as “critic agents,” evaluating and refining the responses generated by their peers.
Multi-Agent Debate: The generation agents produce responses, which are then assessed by the critic agents through an iterative debate process.
Communication between agents occurs in natural language (English), though alternative formats like JSON or latent representations could also be explored.
Consensus and Data Selection: The best responses are determined by majority vote. Responses that align with the consensus are used as training data for their respective generating agents.
Specialized Training: Each agent is fine-tuned only on the subset of data it generated that matched the consensus. This prevents convergence to a single reasoning pattern and fosters diverse problem-solving strategies.
Iterative Improvement: The cycle repeats across multiple rounds, leading to continuous specialization and improvement.

Key Findings and Advantages

Multiagent fine-tuning graphic from the paper — From the paper: Graphic showing how the multiagent finetuning procedure enables models to improve across multiple iterations of finetuing. Results reported on the MATH dataset.

The multi-agent fine-tuning approach introduces a paradigm shift in model training, allowing for sustained performance improvements that were previously unattainable with single-agent methods. By maintaining multiple agents with distinct roles and promoting diverse reasoning strategies, the system is able to consistently refine its outputs without falling into the common pitfalls of homogenization and stagnation.

Here are some of the most notable advantages of this approach…

Sustained Performance Gains: Unlike traditional fine-tuning methods, which plateau after a few iterations, this multi-agent approach maintains steady improvements over successive training rounds.
Enhanced Reasoning: The method leads to significant performance improvements on complex reasoning tasks.
Generalization to New Datasets: Fine-tuned models can transfer their learned reasoning strategies to novel datasets without requiring direct training on them.
Maintained Response Diversity: By ensuring each agent is trained only on its own correct responses, the approach preserves a variety of reasoning paths, mitigating the collapse into uniform outputs that often plagues single-agent fine-tuning.
Broad Applicability: The technique has been successfully tested on both open-source models (e.g., Phi-3, Mistral, LLaMA-3) and proprietary models (e.g., GPT-3.5), demonstrating its versatility.

Challenges and Other Considerations

While promising, multi-agent fine tuning also presents several challenges:

Diversity Maintenance: Ensuring that agents don’t converge to similar reasoning patterns requires careful data selection strategies.
Coordination vs. Independence: There’s a trade-off between improving individual agent performance and optimizing the overall system’s cooperative effectiveness.
Computational Cost: Training and running multiple agents require significantly more resources than single-agent fine-tuning, though it remains more efficient than explicit reasoning models.
Real-World Applicability: While the method has shown strong results in reasoning tasks, further exploration is needed to assess its effectiveness in broader, real-world applications.

Potential Use Cases and Future Directions

Du highlighted several exciting directions for this research:

Expanding Model Diversity: Future iterations may start with different base models for each agent to further enhance diversity.
Single-Model Adaptation: The methodology could potentially be adapted for single-model settings by prompting a model to generate responses using varied reasoning patterns.
Alternative Communication Methods: Exploring structured formats like JSON or latent spaces for agent communication could optimize information sharing.
Optimizing Multi-Agent Coordination: Further research is needed to refine how agents collaborate to maximize system-wide performance.

Conclusion

The multi-agent fine-tuning framework represents a significant advancement in LLM self-improvement, fostering sustained performance gains by harnessing diversity and specialization. By enabling models to iteratively learn from a team-based approach rather than a single iterative loop, this method opens the door to more robust, generalizable, and continually improving AI systems.

Share

Suggested reading

How do you make an LLM, anyway? Microsoft just published a textbook.

CUGA Agent: From Benchmarks to Business Impact of IBM’s Generalist Agent