GSoC 2026 ideas #2927

EwoutH · 2025-12-05T09:15:48Z

EwoutH
Dec 5, 2025
Maintainer

The idea list with committed mentors is prepared here. See also the Mesa's Google Summer of Code 2026 guide.

Let's start the discussion on 2026 GSoC ideas!

2025 ideas can be found here. Ones leftover from last year:

Behavioral framework: This project aims to develop a unified behavior and state management system for Mesa, addressing the current limitation of simple discrete-time steps by enabling continuous state changes, time-consuming tasks, and sophisticated decision-making. By creating a comprehensive framework that integrates task priority scheduling, interruption capabilities, and behavioral theories, the initiative will allow modelers to simulate realistic agent behaviors—such as resource depletion and parallel activities—more effectively. The expected outcome is a robust API and event system that empowers users to build complex, theoretically grounded agent-based models with greater ease and flexibility.
- Relevant discussions and groundwork include #2529, #2526, #2538, and PR #2547.
Unifying geospatial Support in Mesa: This initiative seeks to merge the standalone mesa-geo(https://github.com/projectmesa/mesa-geo) package directly into the core Mesa library as a mesa.geo module, resolving compatibility issues arising from their separate evolution and simplifying dependency management. By leveraging Mesa's new experimental cell and continuous space architectures, the project will create a unified spatial modeling framework that supports GIS functionality, coordinate transformations, and standard file formats like GeoJSON within a consistent API. The consolidation aims to make advanced geospatial modeling a first-class feature, ensuring that property layers and spatial visualizations work seamlessly across all Mesa projects.
- Key technical context includes the experimental cell space, continuous space implementation, and the Space conceptual model.
Mesa Blocks: Mesa Blocks proposes the development of a low-code/no-code extension designed to help users rapidly assemble and run custom agent-based models using a visual drag-and-drop interface. Recognizing the difficulty in building complex models from scratch, this project aims to provide a system of reusable, customizable building blocks that allow users to quickly prototype valid models while retaining the ability to modify the underlying logic. The ultimate goal is to create a prototype that democratizes access to agent-based modeling, allowing for faster decision-making and model exploration through a user-friendly, modular environment.
- Historical context and inspiration can be found in previous ecosystem attempts and Reusable Building Blocks for ABMS.

What more ideas and ambitions do we have?

EwoutH · 2025-12-05T09:26:51Z

EwoutH
Dec 5, 2025
Maintainer Author

We did talk about performance optimization in the past, and a Mesa performance optimization project might be interesting this year.

Mesa's scalability to millions of agents depends on efficient core operations in AgentSet, spatial grids, and event scheduling. This project could systematically identify and address performance bottlenecks across the library. The first phase involves comprehensive profiling of Mesa's example models (Boltzmann Wealth, Schelling, Wolf-Sheep, Flocking) using tools like cProfile, py-spy, and memory_profiler to create a performance baseline and identify hotspots. Likely candidates include AgentSet operations (shuffle_do, select, groupby, agg), grid neighbor lookups, and PropertyLayer modifications.

The second phase could explore optimization strategies: expanding NumPy vectorization for batch agent operations, restructuring data layouts for cache efficiency, and evaluating Rust acceleration via PyO3 for compute-intensive components like spatial indexing, large-scale shuffling, and event queue management. Rust is particularly promising for operations with clear data boundaries (grids, coordinate math) where Python object overhead can be avoided.

Deliverables could include a reproducible low-level benchmarking suite, documented performance improvements with before/after comparisons, and potentially a mesa-accel optional dependency for Rust-accelerated components. The contributor should have Python profiling experience and interest in either NumPy internals or Rust/PyO3 (though the discovery phase will determine which optimization strategies yield the best results for Mesa's specific workloads).

@Ben-geo @adamamer20 I'm also curious if there are lessons or techniques of mesa-frames transferable to the main library.

0 replies

madhavik-2005 · 2025-12-06T06:35:15Z

madhavik-2005
Dec 6, 2025

These project ideas look great! I'm really interested in getting involved with Mesa for GSoC 2026.
I've read through the candidate guide, and it's clear that building models first is the way to go. So that's my plan - dive into mesa-examples, build some stuff, and actually experience what works and what's frustrating. I want my proposal to come from real hands-on experience, not just theory.

The Behavioral Framework project really caught my attention. I find it fascinating how individual agents making their own decisions can lead to complex emergent behaviors in the system. What excites me most is the challenge of taking these theoretical behavioral models - things like BDI or needs-based architectures - and turning them into something practical that people can actually use. It's basically building the "brain" for agents, which is pretty cool.
My plan for the next few months:

Work through the tutorial and build models that run into the behavioral limitations we're trying to solve
Contribute some models to mesa-examples
Deep dive into the discussions in Continuous States #2529, Tasks #2526, and Behavioral Framework #2538
Learn more about behavior trees, GOAP, BDI architectures, and related concepts
Try building a simple proof-of-concept for a Task or State management system

One question: which existing Mesa models would you recommend looking at to see how people currently work around things like time-consuming tasks, competing priorities, or continuous state changes? I'd love to understand the current pain points from real examples.

5 replies

EwoutH Dec 6, 2025
Maintainer Author

Thanks for reading up before asking questions, and then asking targeted, informed questions. That's appreciated!

Most of our models can be improved in this regard, here are some examples:

Wolf–Sheep Predation: A classic predator–prey model where wolves and sheep move, eat, and reproduce while tracking continuous energy levels. It highlights how even simple agents require juggling multiple drives (survival, movement, reproduction) using hand-coded logic. Pain point: Multi-step actions (e.g., hunting) must be faked with per-tick rules because Mesa lacks task duration or priority management.
Schelling Segregation: Agents evaluate local similarity and relocate when dissatisfied, producing emergent spatial patterns from threshold-based decisions. It’s a clean example of stateful decision-making driven by continuous dissatisfaction metrics. Pain point: Moves are instantaneous and there’s no concept of gradual action, making it impossible to model multi-step relocation or inertia.
Epstein's Civil Violence: Citizens and cops interact based on grievance, risk, and legitimacy, approximating BDI-like reasoning through inline formulas. It’s a strong case of complex motivations crammed into a single step function. Pain point: The behavior architecture collapses into bulky conditional logic because Mesa offers no structure for beliefs, desires, or competing goals.
Boids (Flocking): Agents compute steering forces (alignment, cohesion, separation) continuously, creating smooth flocking. This model stresses continuous state updates and layered behaviors in every tick. Pain point: All behaviors are blended manually each step instead of modularized, showing the need for stackable behavior components or continuous behavior nodes.
Sugarscape: Agents consume resources, metabolize energy, move based on local gradients, and even trade in extended versions. It demonstrates rich, multi-variable needs that demand arbitration between competing goals. Pain point: All needs (food, movement, vision, trading) get stuffed into one big step method, illustrating how badly Mesa lacks goals, tasks, or behavior trees.

You can find them in https://github.com/projectmesa/mesa/tree/main/mesa/examples

So note there can be two distinct goals:

Make existing complex behavior easier to implement
Allow new, previously impossible, behaviour

Project can address either one of these or both.

madhavik-2005 Dec 6, 2025

This is really helpful, thanks! I'll start implementing these models and dig into the pain points. If I get stuck or have questions along the way, I'll reach out.

PT-10 Dec 10, 2025

@EwoutH I read the discussions, and have a couple of ideas to get myself started. I see that mesa-llm was worked upon last year, so I have a social simulation that can be added as example if I get it right. The first is a more complex and dynamic extension of the world we live in, where essentially an agent powered by an llm is given in charge of a household, each having varying degrees of income, behaviour patterns and respective goals. This would focus on allowing newer, complex behaviours. This is something I plan on working to better understand the framework as well as the mesa-llm capabilities.

Secondly, I was wondering if the idea of an open source llm finetuned to help with mesa code is something that sounds appealing. I have a background in deep learning, and would like to know your thoughts on pursuing this as a project and if so the expected scope

EwoutH Dec 10, 2025
Maintainer Author

I have a social simulation that can be added as example if I get it right

Cool, be sure to open an PR to Mesa-examples!

the idea of an open source llm finetuned to help with mesa code is something that sounds appealing

Personally, based on my limited knowledge and experience in this topic, with current frameworks for fine tuning and without continual learning, I think context engineering and scaffolding is enough to get LLMs to write proper Mesa code. The conceptual modeling decisions and data are more challenging than the code itself anyway. But I’m open to learn on this topic!

PT-10 Dec 10, 2025

Yes current setups are enough, the context length is sometimes a friction though and multiple iterations to get it done. Rather I would agree on the fact that not having continual learning is a bigger issue because what would really benefit such a model is to keep updating itself as the codebase expands and gets updated

pragam-m25 · 2025-12-06T12:24:38Z

pragam-m25
Dec 6, 2025

Hi @EwoutH, exciting list!

Building on the "Behavioral Framework" and "Performance" ideas, I'd like to propose a distinct direction for 2026 (or potential experiments in 2025): "Generative Agents Integration" (LLM-driven Behavior).

While the Behavioral Framework (BDI/GOAP) is excellent for deterministic, rule-based complexity, there is a growing need for agents that can reason dynamically via LLMs (inspired by the "Generative Agents" architecture by Park et al.).

Idea: Create a LLMAgent class or mixin for Mesa that:

Orchestrates API Calls: Seamlessly connects to Local LLMs (Ollama) or APIs (Gemini/OpenAI).
Enforces Structured Actions: Uses the new Mesa 3.0 shuffle_do mechanism by forcing the LLM to return valid JSON actions (e.g., {"action": "move", "pos": [1,2]}).
Manages Context Window: Handles agent memory (summarizing past steps) so the simulation doesn't crash due to token limits.

I believe this complements the traditional Behavioral Framework by offering a "Probabilistic" alternative for scenarios where defining explicit rules is too difficult (e.g., natural language negotiation between agents or simulating social dynamics).

I've already started experimenting with this using the new Mesa 3.0 syntax and ensuring strict JSON outputs. I would love to explore this further as a potential project area.

1 reply

EwoutH Dec 6, 2025
Maintainer Author

Thanks for proposing it. You have seen https://github.com/projectmesa/mesa-llm right?

pragam-m25 · 2025-12-06T13:46:54Z

pragam-m25
Dec 6, 2025

Thanks for pointing that out!

Yes, I have explored mesa-llm. It’s a great foundation.

My proposal is essentially to modernize and expand that initiative for the Mesa 3.0 era. Specifically, I aim to:

Update Architecture: Refactor the integration to align with the new Mesa 3.0 standards (removing schedulers, using shuffle_do).
Enforce Reliability: Implement strict Structured Outputs (JSON). A common issue with basic LLM integration is "hallucinated actions"; my focus would be on robust parsing so agents don't break the simulation loop.
Local LLM Support: Ensure native compatibility with tools like Ollama/Llama.cpp for users who can't afford API costs (crucial for students).

I’d love to take mesa-llm from an experimental state to a core-compatible library that works out-of-the-box with the new ecosystem.

2 replies

colinfrisch Dec 9, 2025
Maintainer

We greatly appreciate ideas for mesa-llm if you have any !

However, I encourage you to take a deeper look at mesa-llm in person and not only through the lens of ChatGPT or other tools like this.
To answer to each of your comments :

no scheduler is used in mesa-llm, and it's fully up to date with mesa3.0
we already use enforced structured output as jsons
we already provide support for local models via ollama as well as online API for bigger models

For next time, it gives better impression if it looks like you've actually seen the library once when you claim to have done it ;)

pragam-m25 Dec 10, 2025

Hi @colinfrisch,

Touché. You are absolutely right, and I owe you an apology.

I made the mistake of making assumptions based on a superficial scan of the repository rather than a deep code review. That was sloppy on my part, and I appreciate you calling it out.

It is genuinely impressive that mesa-llm already handles structured JSON and local models seamlessly with Mesa 3.0. That puts the foundation much further ahead than I realized.

I will take a step back and properly study the existing codebase (actually running it this time) after my exams conclude in mid-January. If I find a genuine gap—perhaps in advanced memory architectures or specific agent evaluations—I will propose something concrete then. Until then, I’ll hold off on making claims.

Thanks for the honest check. Lesson learned. 😅

EwoutH · 2025-12-06T17:14:03Z

EwoutH
Dec 6, 2025
Maintainer Author

I have two more: Making clean-sheet designs of how we run models in experimental setups and how we collect data.

Mesa's current batch_run and DataCollector were designed for simpler times: single machines, modest datasets, basic parameter sweeps. As agent-based modeling research has matured, users could now be empowered by complex experimental designs, multi-node HPC environments, heterogeneous data streams, and analysis workflows that demand interoperability with the broader scientific Python ecosystem. These ideas offers an opportunity to fundamentally rethink how Mesa approaches model execution and data management for Mesa 4.0, learning from a decade of ecosystem evolution and user experience rather than incrementally patching legacy designs.

Reimagining Model Execution and Experimental setup

The fundamental problem with batch_run isn't what it does, it's what it doesn't know it needs to do. Modern ABM research could benefit from sophisticated and efficient experimental designs (Latin hypercube sampling, Sobol sequences, adaptive sampling), execution across diverse computing environments (laptops to HPC clusters to cloud), provenance tracking for reproducibility, and graceful handling of long-running or failed experiments. This project should begin by deeply understanding the problem space: What experimental workflows do Mesa users actually need? How do other ABM platforms (NetLogo's BehaviorSpace, RePast, FLAME) and general-purpose tools (EMAworkbench, SALib, hyperopt, pyDOE) approach experiment management? What can we learn from how the scientific Python ecosystem handles distributed computing (Dask, Ray, IPyParallel) and workflow orchestration?

The goal isn't to design a specific solution but to map the territory: What capabilities should Mesa provide natively versus enable through documented patterns? Where are the natural architectural boundaries between experiment specification, execution strategy, and result aggregation? How do we balance simplicity for beginners running local parameter sweeps against power users orchestrating thousands of replications across HPC resources? What standards exist for experiment metadata and provenance that Mesa should adopt? The project could start with a comprehensive requirements analysis, ecosystem survey, user research findings, and architectural principles that can guide Mesa 4.0's design. And if time allows, a (proof of concept) implementation.

Next-gen batch runner #2321
(and slightly related) A system for managing Model/Agent default values and ranges #2268

Rethinking Data Collection and Management

DataCollector's core limitation is that it conflates too many concerns: what data to collect, when to collect it, how to store it, and how to retrieve it for analysis. As models have grown more complex with heterogeneous agent types, spatial property layers, network dynamics, and event streams, this monolithic approach shows its age. Meanwhile, the scientific Python ecosystem has evolved powerful tools for multi-dimensional labeled arrays (Xarray), efficient columnar storage (Parquet, Zarr), high-performance analytics (Polars, DuckDB), and streaming data processing. This project should explore what a modern data architecture for ABM could look like by first understanding what problems users actually face.

The research phase should investigate: What data patterns do ABM researchers need to capture, and how do current workflows break down? What can we learn from how climate models, computational biology, and other simulation-heavy domains handle output management? Where does the impedance mismatch lie between ABM data patterns and standard scientific data formats? How do users currently bridge Mesa outputs to their analysis tools, and what friction exists? What performance characteristics matter most—collection overhead during simulation, storage efficiency, or query speed during analysis? The contributor should survey existing solutions, prototype integration patterns with ecosystem tools, and identify fundamental architectural questions: Should Mesa embrace lazy evaluation and streaming? How can we handle the "collect everything vs. collect strategically" tension? What's the right balance between flexibility and performance?

The outcome should include a clear problem taxonomy, evaluation of how existing tools address (or don't address) ABM-specific needs, user requirements gathered from community research, and architectural recommendations for Mesa 4.0. Both projects emphasize discovery over delivery: understanding the landscape, learning from successes and failures elsewhere, and establishing principled foundations rather than rushing to implementation. One we know what we need to build and which tools we have to build it, the implementation becomes relatively easy.

The future of data collection #1944
(and the older, previous discussion) Multi-Agent data collection #348

1 reply

ShreyasN707 Feb 17, 2026

After following this discussion (#1944) and the more recent work around DataRegistry, DataSet, and Recorder, one area that seems both critical and still relatively underdeveloped is the storage backend layer that Recorder delegates to.

From the direction outlined here — especially the separation between live state (DataSet) and snapshot recording (Recorder) — it’s clear that Recorder is intended to be backend-agnostic, with support for multiple storage targets such as in-memory, SQL, Parquet, or numpy-based backends. The introduction of a StorageBackend protocol and typed Snapshot abstraction is a really strong foundation, but there doesn’t yet appear to be a complete, consistent set of reference backends designed specifically for scalable experiment workflows.

A practical contribution in this area could be to implement a modular, production-ready backend framework that integrates cleanly with Recorder and supports efficient storage, retrieval, and analysis for large simulations.

Concretely, this would involve:

• Implementing one or two fully supported backend implementations, such as:

– ParquetBackend, using PyArrow, to provide efficient columnar storage, compression, and direct interoperability with Polars, DuckDB, and pandas
– DuckDBBackend, enabling fast local querying and analysis of simulation results without requiring an external database

• Ensuring these backends implement the existing StorageBackend protocol properly and support all Snapshot kinds (agent datasets, model datasets, numpy datasets, and event logs).

• Designing a clear and consistent schema mapping between DataSet snapshots and stored representation, aligned with the schema-based storage approach discussed earlier in this thread, so that attribute names, agent IDs, and timestamps are stored efficiently without redundant key duplication.

• Making backend switching fully transparent at the model.data level, for example:

model.data.backend = ParquetBackend("./results")

without requiring changes to dataset definitions or Recorder logic.

• Supporting efficient retrieval back into analysis tools, so users can load results directly into pandas, Polars, or DuckDB for downstream analysis.

This aligns directly with several design goals discussed in #1944 :

– Making data collection extensible and backend-independent
– Supporting large-scale and long-running simulations without excessive memory usage
– Enabling integration with modern scientific data tooling
– Preserving a clean separation between state tracking (DataSet) and storage (Recorder)

It would also complement the current Recorder architecture well, since Recorder already handles snapshot generation and is designed to delegate persistence to interchangeable backends.

A concrete first step would be implementing ParquetBackend and DuckDBBackend as reference implementations, validating them against the experimental data collection workflow, and ensuring they support agent, model, and event datasets consistently.

I’d be interested to hear whether expanding the storage backend layer in this way would be a useful and aligned direction with the ongoing redesign of Mesa’s data collection system.

EwoutH · 2025-12-12T14:21:12Z

EwoutH
Dec 12, 2025
Maintainer Author

Since there is huge interest in doing things with ML/DL/RL/AI, here is something that might be interesting:

A concrete research idea is to build a “Mesa Inference” layer that turns any Mesa model into a calibratable scientific simulator: users specify a parameter prior, an observation model (what macro data they have), and an extraction function that maps each simulation run into a vector of macro summaries (or a learned embedding). Mesa already provides a natural home for this through its core model loop, data collection, and analysis workflow, so the research contribution is primarily about standardizing the simulator ↔ inference boundary. The immediate goal is posterior inference over micro-parameters given macro observations—i.e., identifying which micro behaviors are consistent with the observed macro phenomenon—using modern likelihood-free inference methods. This directly aligns with the Mesa ecosystem and its focus on reproducible ABM experimentation.

Methodologically, the first integration target could be simulation-based inference (SBI) as a default backend. This would involve a Mesa-to-SBI adapter that exposes a function of the form simulate(theta, seed) -> summary, supports batching and parallel execution, and enforces reproducible seeding, then connects this interface to libraries such as sbi to learn approximations to (p(\theta \mid x)) from ABM simulations. In parallel, the framework should support Approximate Bayesian Computation (ABC) as a baseline and diagnostic tool, following canonical treatments such as Beaumont’s review of ABC methods (Beaumont, 2019). A key research contribution lies in summary learning: enabling optional autoencoder-based embeddings of high-dimensional ABM outputs while still allowing users to enforce theory-driven, interpretable summaries (e.g., stylized facts, moments, or distributional targets).

A second, more exploratory research track is to support gradient-assisted calibration for differentiable or partially differentiable ABMs implemented in Mesa. The idea is to provide an alternative execution mode for models whose agent transitions or aggregation steps are differentiable (or can be relaxed), enabling gradient-based optimization and sensitivity analysis alongside traditional simulation. The deliverable could be a “differentiable Mesa kernel” for a restricted subset of schedulers and spaces, coupled to automatic differentiation frameworks, with seamless fallback to standard stochastic simulation for general models. This direction builds on prior demonstrations that differentiable ABMs can significantly accelerate calibration, such as Kotthoff’s work in JASSS on gradient-based calibration of ABMs (Kotthoff, 2022), and would position Mesa at the frontier of scientific machine learning for agent-based modeling.

Finally, the framework should explicitly support partitioning variance between hypothesized structure and unexplained effects, enabling a form of hypothesis evaluation rather than mere fitting. This can be done via nested model comparisons: (i) ABM-only predictions of macro outcomes, (ii) ABM plus a flexible residual model that captures systematic deviations, and (iii) a purely black-box benchmark. The relative out-of-sample performance of these models quantifies how much variance is explained by the ABM’s micro-level hypothesis versus residual structure. Within an SBI or ABC framework, this analysis naturally incorporates parameter uncertainty and allows posterior predictive checks to diagnose model misspecification. Reporting cross-validated error reductions, posterior predictive discrepancies, and mechanism-level ablations would make the explanatory contribution of the ABM explicit and measurable (see sbi and Beaumont’s ABC review for canonical methodology).

0 replies

brionysayani · 2025-12-23T11:56:46Z

brionysayani
Dec 23, 2025

Hi everyone, I'm Briony!
I am interested in getting involved with Project Mesa for GSoC 2026
I worked through introductory tutorial today in colab and created money model.
It was great getting hands-on with the ABM for the first time.
I have python proficiency and I am curious to learn more about ABM
Also, mesa blocks project idea seems pretty interesting to me - a low-code/no-code extension and custom agent-based models using a visual drag-and-drop interface.

0 replies

JoelJose23 · 2025-12-28T18:54:37Z

JoelJose23
Dec 28, 2025

Hi everyone I am Joel Jose,

I have prior experience working with CNN-based models and I’m particularly interested in how AI systems perceive visual information.

I’m exploring the idea of introducing a minimal perception abstraction for Mesa agents. The concept would start very small: for example, agents could receive simple visual inputs, like colors or local spatial patterns, converted into numerical observations. Over time and through incremental development, agents could learn to interpret more complex observations, potentially evolving toward recognizing structured patterns or small images.

The goal would be to implement this as a modular, optional extension that doesn’t affect Mesa’s core scheduler or deterministic execution, while allowing experimentation with perception-based learning in agent-based simulations.

I’d love feedback from the maintainers on whether this direction aligns with Mesa’s road map or if there are existing discussions or abstractions I could build on.

0 replies

EwoutH · 2026-01-08T11:52:11Z

EwoutH
Jan 8, 2026
Maintainer Author

Reviving Mesa-examples

After moving our core examples to the main repo (see #2330 (comment) and #2364), mesa-examples got a bit abandoned. We stopped extensive testing in the main Mesa CI (we do that on our core examples now). Defining, reviving and formalizing Mesa-examples could be really useful.

A proposal on this topic could answer (one or more) questions like:

How can we make it attractive to contribute examples to mesa-examples?
How can we keep them up to date?
How can we enforce they keep working? (versioning, testing, CI, etc.)
How can we make our examples more attractive to use? (findability, documentation, guidance, etc.)

This is my initial take, any other visions/ideas are welcome on this!

11 replies

areyoush Jan 27, 2026

I've been thinking about one aspect of discoverability. The current categorization structure of example models in mesa-examples

Currently example models are categorized mostly just on the basis of spatial implementation (with a few exceptions). They lack any kind of domain specification (such as NetLogo's approach for their sample models that segments them into domains like Biology, Social Science, Art, Mathematics, etc).

While citations in READMEs can make up for domain context or source, it certainly won't make the process of finding a domain specific example model any easier for a new user.

One possible improvement could be to introduce domain-level categorization either though domain-based directory structure, tags or metadata files in every example model's folder (such as a metadata.toml file) that would explicitly capture the domain an example belongs to. This could then be used to enable domain-based browsing or filtering in the documentation making it easier for users to discover example relevant to the kinds of problems they are interested in without requiring prior knowledge of the repository's structure or Mesa's internal abstractions.

We could further add more filters such as implementation complexity, core features and tooling, etc.

The said metadata.toml file might look something like:

title = "Deffuant–Weisbuch Opinion Dynamics"

# Domain-level discoverability
domain = [
  "social_science",
  "opinion_dynamics"
]

# How difficult the example is to understand / extend
complexity_level = "intermediate"

# Mesa components demonstrated by this example
mesa_features_used = [
  "RandomActivation",
  "DataCollector"
]

# High-level modeling idea
[model]
concept = "agent_interaction"

# Source 
[source]
type = "paper"
citation = "Deffuant, G., Neau, D., Amblard, F., & Weisbuch, G. (2000). Mixing beliefs among interacting agents. Advances in Complex Systems."

# Optional but can be useful for docs / gallery
[docs]
short_description = "A bounded-confidence opinion dynamics model on a network."
learning_objectives = [
  "Demonstrate bounded-confidence interaction rules",
  "Show opinion clustering on networks",
]

# Who maintains this example
[maintenance]
maintainer(s) = "@github-username"
last_updated = "2026-01-24"

EwoutH Jan 27, 2026
Maintainer Author

I think something like this could be worth discussing further.

Feel free to create a new discussion thread for this. Maybe in Mesa-examples.

areyoush Jan 27, 2026

I've started a discussion at #313.

areyoush Jan 27, 2026

I’ll be starting discussion points for the other goals as well inside that thread as I continue to study, brainstorm, and scope the project further!

ShreyasN707 Mar 13, 2026

@EwoutH I have been building example models and contributing to mesa-examples for the past few months.

While going through the repository and the discussions here, I was thinking about a possible direction that might help make mesa-examples easier to maintain while also improving discoverability and contributor experience.

One idea could be to separate example code from the infrastructure used to maintain and validate it, so contributors mainly focus on the model itself while automation handles compatibility testing, validation, and indexing.

For example, the repository could follow a structure like this:

examples/
│
├── forest_fire/
│ ├── example/ # model code
│ ├── docs/ # explanation and usage
│ └── infra/ # metadata, tests, CI helpers
│
├── schelling/
│ ├── example/
│ ├── docs/
│ └── infra/
│
└── traffic/
├── example/
├── docs/
└── infra/

In this setup:

example/ contains the actual Mesa model implementation
docs/ contains the explanation and usage documentation
infra/ contains metadata, dependency definitions, and validation tests used by automation

This keeps the example code simple for contributors while allowing the repository to support automated tooling.

Example Metadata

Each example could include a lightweight metadata file:

name: Forest Fire
domain: ecology
difficulty: beginner
mesa_version: ">=3.0"
tags: [grid, spatial]

This metadata could be used for categorization and indexing.

Automation Pipeline

Conceptually the maintenance pipeline could look like this:

Example Code
     │
     ▼
Metadata + Validation
     │
     ▼
CI Compatibility Tests
     │
     ▼
Example Health Status
     │
     ▼
Searchable Example Gallery

This would help ensure that examples:

remain functional as Mesa evolves
are easier to discover
require less manual maintenance

At the same time contributors would mainly interact with just:

example/
docs/

which keeps the contribution process simple.

Further i am working on this idea!
Curious to hear if a structure like this might make sense for the future of the repository, or if there are concerns or improvements worth considering.

pragam-m25 · 2026-01-15T10:42:02Z

pragam-m25
Jan 15, 2026

Hi @EwoutH, thanks for laying out these ideas so clearly — especially the repeated emphasis on building models first before committing to new abstractions.

One direction I’d like to explore (as a contributor, not as a framework proposal yet) is using new example models as a diagnostic tool for the Behavioral Framework discussion.

Concretely, instead of starting from BDI / task systems / APIs, I plan to:

design a small but non-trivial example model that is not currently present in Mesa,

implement it using only existing Mesa primitives,

and use that experience to surface where current patterns feel natural vs where behavior logic becomes awkward or overly tangled.

The type of behavior I have in mind is adaptive / experience-based decision-making within a single run:

agents choose between alternatives (e.g. risky vs safe actions),

remember negative outcomes locally,

and adjust future decisions based on that memory.

This is intentionally below a full behavioral framework:

no new APIs,

no new abstractions,

just a clean example + README that documents the pain points encountered.

My goal is to contribute this as a new example, and only then reflect on what it teaches us about states, decisions, action duration, or task structure — feeding that insight back into the broader Behavioral Framework discussion.

If this example-first, evidence-driven approach is useful, I’d be happy to iterate further and expand it based on feedback.

2 replies

EwoutH Jan 15, 2026
Maintainer Author

I think this will be excellent pathfinding!

pragam-m25 Jan 15, 2026

Hi @EwoutH,

I’ve opened a small PR that adds a minimal CI smoke test for batch_run
using the Boltzmann Wealth example, based on the gap discussed in #2184.

Whenever you get a chance, I’d really appreciate a quick review or any
feedback on whether this direction aligns with the current CI strategy.

Thanks a lot!

Nithurshen · 2026-01-24T12:05:58Z

Nithurshen
Jan 24, 2026

3 replies

EwoutH Jan 24, 2026
Maintainer Author

Sounds interesting. I think it’s a good start.

I think decide might also need something of a bound as input, like which actions are possible/allowed (i.e. RL action_space?). But that can also be investigated later.

Nithurshen Jan 25, 2026

Thank you @EwoutH for this insight.

I agree with you. A raw decide(state) interface is too open-ended. Passing explicit constraints is critical for RL agents and LLM agents. I will refine the interface design to include this constraint propagation.

I have now implemented the Durative Tasks and Interruption primitives we discussed.

I refactored the WolfSheep example to validate them:

Reactive State: Sheep use mesa_signals to detect hunger without polling.
Durative Actions: Grazing is a task that takes 5 ticks (using DEVS).
Interruption: If a Wolf is detected while grazing, the task is cancelled immediately so the Sheep can flee.

I have opened a Draft PR with the implementation code and execution logs here: #3206

Now that the execution primitives are working, I will circle back to designing the decide(state, action_space) interface as you suggested.

Nithurshen Jan 25, 2026

from abc import ABC, abstractmethod

class DecisionModule(ABC):
    """
    The class that selects actions. 
    Can be swapped with RuleBased, LLM, or RL Policy.
    """
    @abstractmethod
    def select_action(self, state: dict, action_mask: list[str]) -> str:
        """
        Args:
            state: The current observation (e.g., {'energy': 5, 'wolves_near': True})
            action_mask: List of PHYSICALLY possible actions (e.g. ['GRAZE', 'MOVE'])
                         If a Wolf is near, 'GRAZE' might be removed from this list.
        """
        pass

class RuleBasedLogic(DecisionModule):
    def select_action(self, state, action_mask):
        # 1. Interruption logic moved here
        if "FLEE" in action_mask and state.get("threat"):
            return "FLEE"
        
        # 2. Needs
        if "GRAZE" in action_mask and state.get("energy") < 10:
            return "GRAZE"
            
        # 3. Default
        return "WANDER"

I’ve drafted the design for decision module and action spaces. Before I implement it, I want to check if this API signature aligns with your vision.

liyuezha · 2026-01-25T17:10:23Z

liyuezha
Jan 25, 2026

Hi everyone I'm Liyue.

Based on my research needs and the recent community discussions on Scenarios (#3176), I'm exploring the idea of mesa-calibration. It aims to provide a standardized interface for automated parameter optimization and validation against empirical data. I have posted the technical proposal in a separate discussion here: #3207

I would love to hear any feedback or feature requests from the community!

0 replies

Nithurshen · 2026-01-27T02:43:52Z

Nithurshen
Jan 27, 2026

Modernizing Mesa-Geo: Refactor and Visualization Overhaul

Goal: Refactor Mesa-Geo to align with Mesa's modern architecture, moving from object-based raster storage to high-performance vectorized PropertyLayers, and introducing a stable, layer-based visualization system.

Phase 1: Core Data Refactor (`RasterLayer` & `PropertyLayer`)

The current RasterLayer stores values as attributes on individual cell objects, which creates significant overhead.

Refactor RasterLayer to use HasPropertyLayers (from mesa.discrete_space) as the storage backend. This allows managing raster bands as PropertyLayer objects backed by NumPy arrays.
Implement the RasterLayer API to wrap these core components. This ensures that even if Mesa's internal PropertyLayer implementation evolves (Mesa 4.0), the public Mesa-Geo API remains stable for users.
Implement create_property_accessors to map legacy cell attributes directly to underlying NumPy indices.
Implement RasterLayer.from_file and to_file using rasterio or rioxarray, allowing direct loading of raster bands into the vectorized storage.

Phase 2: Coordinate System & Sampling

To resolve the ambiguity between grid indices and geographic coordinates (Issues #299, #91):

Introduce cell.rowcol for the integer grid index and cell.xy for the continuous geographic centroid within the Cell and GeoAgent classes.
Implement an affine transform (via rasterio.transform) to dynamically translate between these two coordinate systems.
Implement get_random_coord(cell), utilizing the cell's affine bounds and the model's RNG to sample continuous points for realistic agent placement.

Phase 3: The Fluent Aggregation API

To address Issue #81, I will build a fluent, pandas-like API for spatial queries:

Extend RasterLayer to return a specialized CellCollection for neighborhood queries.
Implement methods like .select(condition).aggregate(np.mean). These methods will slice the underlying PropertyLayer NumPy array for immediate results, bypassing slow iteration over cell objects.

Phase 4: Layer-Based Visualization Overhaul

To fix the incompatibility with Mesa ≥ 3.3 (Issue #295) and adopt a GIS-centric design:

Create a new renderer that follows the SpaceRenderer lifecycle (setup vs. draw) but is designed for GIS layer composition.
Instead of a single draw method, implement setup_raster_layer(layer, colormap) and setup_vector_layer(agents, style). This allows users to stack multiple raster bands and agent types arbitrarily.
Ensure this renderer integrates with SolaraViz by returning compatible Altair/Matplotlib objects, removing the need for the legacy MapModule.py.

Phase 5: Quality Assurance & Documentation

Develop comprehensive tests for the affine coordinate transformations, file I/O, and the new aggregation logic.
Create a new example notebook that demonstrates the new fluent API and layer-based visualization.
Update documentation API references to explain the new from_file workflows and coordinate properties.

6 replies

Nithurshen Jan 27, 2026

Thank you for the detailed feedback. Based on your comments, I will modify my roadmap to address the volatility of the core API and the specific needs of GIS visualization.

How about a Layer-Based Renderer. Rather than a single draw_propertylayer, the renderer will support a stack of composable layers (e.g., setup_raster_layer, setup_vector_layer). This mimics GIS tools (like Leaflet/QGIS) where users compose a map from multiple sources, while still adhering to the SpaceRenderer lifecycle (setup vs. draw) for compatibility with Solara.
I am thinking of designing RasterLayer to treat Mesa's PropertyLayer primarily as a backing storage engine. By encapsulating this dependency and keeping the public RasterLayer API distinct, we can shield Mesa-Geo users from potential internal implementation changes in Mesa 4.0.
I will write unit tests for the new affine coordinate transformations and create a tutorial notebook demonstrating the new pandas-like aggregation workflow on a real-world dataset.

Nithurshen Jan 28, 2026

@wang-boyu,
Following your advice on adopting a geo-specific visualization strategy, I have built a working prototype of the Layer-Based Renderer, GeoSpaceRenderer adapter that adheres to the SpaceRenderer interface but internally manages a stack of GIS layers rather than a fixed grid.

import solara
import matplotlib.pyplot as plt
import numpy as np
import mesa
from mesa.visualization.solara_viz import SolaraViz
from mesa.visualization.space_renderer import SpaceRenderer

class GeoSpaceRenderer(SpaceRenderer):
    """
    A Layer-Based Renderer that adapts to Mesa's SpaceRenderer interface.
    """
    def __init__(self, model):
        self.model = model
        self.backend = "matplotlib"
        self.layers = [] 
        self.fig, self.ax = plt.subplots(figsize=(6, 6))    
        self.agent_mesh = True 
        self.space_mesh = False
        self.propertylayer_mesh = False
        self._post_process_applied = False
        self.post_process = None
        class MockBackend:
            def __init__(self):
                self._active_colorbars = []
        self.backend_renderer = MockBackend()

    @property
    def canvas(self):
        return self.ax

    def setup_raster_layer(self, source, cmap="viridis", alpha=1.0):
        self.layers.append({"type": "raster", "source": source, "cmap": cmap, "alpha": alpha})

    def setup_vector_layer(self, source, color="red", marker="o"):
        self.layers.append({"type": "vector", "source": source, "color": color, "marker": marker})

    def draw_agents(self):
        self.ax.clear()
        for layer in self.layers:
            if layer["type"] == "raster":
                data = layer["source"](self.model)
                self.ax.imshow(data, cmap=layer["cmap"], alpha=layer["alpha"], origin="lower")
            elif layer["type"] == "vector":
                coords = layer["source"](self.model)
                if coords:
                    xs, ys = zip(*coords)
                    self.ax.scatter(xs, ys, c=layer["color"], marker=layer["marker"], s=50)
        return True 

    def draw_structure(self): pass
    def draw_propertylayer(self): pass

class MockGeoModel(mesa.Model):
    def __init__(self):
        super().__init__()     
        self.grid_width = 50
        self.grid_height = 50
        self.steps = 0
        self.simulation_agents = [mesa.Agent(self) for _ in range(10)]
        for a in self.simulation_agents:
            a.x = np.random.randint(0, 50)
            a.y = np.random.randint(0, 50)

    def step(self):
        self.steps += 1
        for a in self.agents: 
            a.x = (a.x + np.random.choice([-1, 0, 1])) % 50
            a.y = (a.y + np.random.choice([-1, 0, 1])) % 50


def get_elevation(model):
    """Dynamic Raster: Changes slightly every step"""
    x = np.linspace(0, 10, 50)
    y = np.linspace(0, 10, 50)
    X, Y = np.meshgrid(x, y)
    return np.sin(X) + np.cos(Y) + (model.steps * 0.1)

def get_agent_positions(model):
    """Vector Source: Returns list of (x,y) tuples"""
    return [(a.x, a.y) for a in model.agents]

model = MockGeoModel()

geo_renderer = GeoSpaceRenderer(model)

geo_renderer.setup_raster_layer(get_elevation, cmap="terrain")
geo_renderer.setup_vector_layer(get_agent_positions, color="black", marker="x")

page = SolaraViz(model, renderer=geo_renderer)

Does this adapter approach align with your vision for the visualization overhaul?

Nithurshen Feb 6, 2026

@wang-boyu,
I refactored a standard cellular automaton (Vegetation Growth) to use the PropertyLayer backend with strictly vectorized operations.
Instead of iterating Cell agents, the step logic is now purely numpy based.

import time
import numpy as np
import matplotlib.pyplot as plt
import mesa
from mesa.discrete_space import PropertyLayer

class CurrentCell:
    def __init__(self, x, y, vegetation=0.0):
        self.vegetation = vegetation
    def step(self):
        self.vegetation = min(1.0, self.vegetation + 0.05)

class CurrentModel(mesa.Model):
    def __init__(self, width, height):
        self.cells = [CurrentCell(x, y, np.random.uniform(0, 0.5)) 
                      for x in range(width) for y in range(height)]
    def step(self):
        for cell in self.cells:
            cell.step()

class NewModel(mesa.Model):
    def __init__(self, width, height):
        self.vegetation_layer = PropertyLayer(
            "vegetation", (width, height), default_value=0.0, dtype=np.float64
        )
        self.vegetation_layer.data = np.random.uniform(0, 0.5, (width, height))
    def step(self):
        self.vegetation_layer.data += 0.05
        np.clip(self.vegetation_layer.data, 0, 1.0, out=self.vegetation_layer.data)

def run_benchmark():
    W, H = 200, 200
    STEPS = 30000
    t0 = time.perf_counter()
    current_model = CurrentModel(W, H)
    for _ in range(STEPS): current_model.step()
    t_cur = time.perf_counter() - t0
    print(f"Current Way (Object Loop):   {t_cur:.4f} seconds")
    t0 = time.perf_counter()
    new_model = NewModel(W, H)
    for _ in range(STEPS): new_model.step()
    t_new = time.perf_counter() - t0
    print(f"New Way (Vectorized):    {t_new:.4f} seconds")
    speedup = t_cur / t_new
    print(f"\nSpeedup: {speedup:.1f}x faster")
    plt.figure(figsize=(6, 5))
    plt.imshow(new_model.vegetation_layer.data, cmap="Greens", vmin=0, vmax=1)
    plt.colorbar(label="Vegetation Density")
    plt.show()

if __name__ == "__main__":
    run_benchmark()

Current Way (Object Loop):   38.2370 seconds
New Way (Vectorized):    0.4808 seconds

Speedup: 79.5x faster

I have a few questions moving forward.

Moving to PropertyLayer means we lose cell.step(). Should we strictly enforce NumPy vectorization for performance, or should we expose a lightweight CellAgent wrapper for users who prefer object oriented logic?
PropertyLayer is grid based and unaware of CRS. I am thinking of designing RasterLayer as a wrapper that contains a PropertyLayer for storage but manages the Affine transform separately. This favors composition over inheritance.
Should a single RasterLayer object manage multiple internal PropertyLayers, or should users instantiate separate RasterLayer objects for each band?

Kindly let me know your thoughts on this.
Thank You.

wang-boyu Feb 6, 2026
Maintainer

Thanks for working on this. To answer your questions:

Moving to PropertyLayer means we lose cell.step(). Should we strictly enforce NumPy vectorization for performance, or should we expose a lightweight CellAgent wrapper for users who prefer object oriented logic?

I would prefer to keep cell.step(). Not all models can be easily written in a vectorized manner, especially if there's complex interactions between cells.

PropertyLayer is grid based and unaware of CRS. I am thinking of designing RasterLayer as a wrapper that contains a PropertyLayer for storage but manages the Affine transform separately. This favors composition over inheritance.

Sure but please be aware that there may (or may not) be future changes to PropertyLayer. If there's any changes to it in core Mesa, and we use it in Mesa-Geo, then we'll need to change accordingly.

Should a single RasterLayer object manage multiple internal PropertyLayers, or should users instantiate separate RasterLayer objects for each band?

Currently it's possible to have multiple bands in one RasterLayer, and it's quite common in GIS in general, so I'd like to keep that. This means that one Cell of such RasterLayer can have multiple properties, e.g., cell.elevation, cell.slope, etc.

Then it's possible to have multiple RasterLayers in a GeoSpace, each with its own resolution, transformation, Cell class, and so on.

Nithurshen Feb 7, 2026

How about we allow cell.step() without sacrificing the performance of the underlying NumPy storage. We will design the Cell class as a lightweight view into the RasterLayer data, rather than a data container itself. The Cell object only stores its coordinates (x, y). Properties like cell.elevation will dynamically read or write to the underlying PropertyLayer NumPy array. Users can still use cell.elevation += 1 and cell.step(). Data remains in contiguous NumPy arrays (inside PropertyLayer), enabling fast I/O, aggregation, and rendering.

class RasterCell(Cell):
    def __init__(self, layer, x, y):
        self._layer = layer
        self.x, self.y = x, y
        
    @property
    def elevation(self):
        # Reads directly from the vectorized backend
        return self._layer.bands["elevation"].data[self.x, self.y]
    
    @elevation.setter
    def elevation(self, value):
        # Writes directly to the vectorized backend
        self._layer.bands["elevation"].data[self.x, self.y] = value

    def step(self):
        # User logic works exactly as before
        if self.elevation < 10:
             self.elevation += 1

We will implement RasterLayer to manage a dictionary of PropertyLayer objects, one for each band (e.g., self.bands = {"elevation": prop_layer_1, "slope": prop_layer_2}). The RasterCell will act as the unified interface to access any of these bands.
I have already noted your warning about PropertyLayer volatility. As I have mentioned in my comment before, we wrap PropertyLayer entirely within RasterLayer, by which we shield the user-facing API from internal Mesa core changes. If PropertyLayer changes, we only need to update the internal wiring of RasterLayer.

Does this work for you?

Satyamgupta2365 · 2026-01-31T10:54:25Z

Satyamgupta2365
Jan 31, 2026

Hi @EwoutH, thanks for starting and actively guiding this discussion. I really appreciate the repeated emphasis on understanding real modeling pain points by building and extending concrete Mesa models before introducing new abstractions.

I plan to spend the coming weeks working through existing examples (such as Wolf–Sheep, Schelling, and Sugarscape), experimenting with small extensions or new example models that stress areas like adaptive decision-making, memory, and multi-step actions. The goal is to document where current Mesa primitives feel expressive versus where behavior logic becomes awkward or overly tangled.

I also find the Mesa Inference direction very exciting, particularly the idea of formalizing the simulator–inference boundary and supporting SBI/ABC workflows for calibrating ABMs against observed macro data.

I’ll aim to contribute incrementally through examples, documentation, or small PRs, and share concrete findings back into this discussion rather than proposing abstractions prematurely. Looking forward to learning from and contributing to the Mesa community.

0 replies

AustinD123 · 2026-02-28T13:01:07Z

AustinD123
Feb 28, 2026

Hey @EwoutH, the project ideas and descriptions have been really informative. I’ve used Mesa agents before, and the meta-agent project really caught my attention. I’ve gotten started as instructed in the document, and I was wondering if there is a separate discussion thread for the meta-agent project. Also, are there any current issues I could work on?
Thanks

1 reply

EwoutH Mar 1, 2026
Maintainer Author

@tpike3

Indrasish7 · 2026-03-17T12:01:18Z

Indrasish7
Mar 17, 2026

Hi everyone,

I’m planning to submit a GSoC 2026 proposal for the Mesa-LLM iteration to push to production project.

I’m a recent Computer Science graduate with experience in AI/ML systems, large language models, and agent-based AI architectures. Recently, I’ve been working on an Agentic AI system where LLMs interact with external tools and APIs to perform multi-step reasoning tasks, which sparked my interest in exploring how similar approaches could be applied to agent-based simulations using Mesa.

I’ve started going through the Mesa documentation and exploring the mesa-llm repository to understand how LLM reasoning, memory handling, and API integrations are currently implemented.

Before finalizing my proposal, I wanted to ask:

• Are there specific issues in the mesa-llm repository that would be good entry points for contributors?
• Are there particular areas of the architecture that mentors would most like the GSoC project to focus on?

Looking forward to learning more about the Mesa ecosystem and contributing.

Thanks!
Indrasish Bhattacharjee

0 replies

Indrasish7 · 2026-03-17T12:30:00Z

Indrasish7
Mar 17, 2026

Hi @EwoutH

I’ve started exploring the mesa-llm repository to better understand how LLM-driven agents integrate with the Mesa framework.

While going through the repository and documentation, I noticed a small issue in the README structure and opened a PR to improve it:
mesa/mesa-llm#235

The PR fixes a duplicated section and improves the readability of the supported LLM providers section.

Next, I’m planning to run and analyze the mesa-llm examples to better understand how reasoning, memory, and communication modules interact with Mesa agents. I’m particularly interested in the “Mesa-LLM iteration to push to production” project and exploring areas like improving documentation, testing coverage, and modularization of LLM tools.

Looking forward to contributing more and learning from the community!

0 replies

nouman6093 · 2026-03-19T20:31:08Z

nouman6093
Mar 19, 2026

Hi @EwoutH and @quaquel! I'm Nouman Hameed, a BSCS student from Pakistan. I've been following the discussion on the Behavioral Framework.

Since I have a strong background in Scikit-learn, I'm interested in implementing 'Learning Agents' that use classic ML models (like Random Forest or SVM) to drive decision-making. My goal is to show how we can replace bulky 'if-else' logic in models like Sugarscape or Wolf-Sheep with trained Sklearn estimators.

I’ve already started looking at the mesa-examples to identify where these ML baselines would be most effective. Does this 'ML-as-a-Brain' approach align with what you're looking for in the Behavioral Framework project?

0 replies

darraghmoran2025 · 2026-03-22T18:17:55Z

darraghmoran2025
Mar 22, 2026

Hi @EwoutH and @quaquel ,
I'm Darragh Moran, a third-year Financial Mathematics & Economics student at the University of Galway, applying for the Behavioral Framework project.

I've spent the past few days building up to the project from first principles: 5 models in Mesa 3.x (Schelling → Wolf-Sheep → Wealth/NetworkGrid → BDI prototype → Q-learning RL agent), with a running log of 15 pain points encountered along the way. The BDI work includes a reusable BDIAgent base class with a full perceive → deliberate → execute cycle. I have pushed it from my own repo and added it in to the format requested in the forked GSoC Learning Space.

Repo: https://github.com/darraghmoran2025/GSoC-learning-space

Key finding while researching: mesa.experimental.actions already covers part of pain point #9 (multi-step plans). My proposal angle is to build the BDI deliberation layer on top of the Actions system rather than replacing it — beliefs, desire selection, and replanning logic that integrates with what's already there.

I've also opened two discussions on pain points not yet tracked: partial observability in grid spaces, and IntentionQueue semantics.

Happy to hear whether those are in or out of scope for this project.

1 reply

darraghmoran2025 Mar 22, 2026

I have also just submitted an example to mesa/mesa-examples with mesa/mesa-examples#434, interested in hearing any feedback!

yoginlangalia · 2026-03-22T19:11:17Z

yoginlangalia
Mar 22, 2026

Hi! I'm Yogin, a 2nd year undergrad interested in the mesa-examples revival project for GSoC 2026.

I spent a few days exploring the examples repo and noticed there are zero continuous space examples (the README literally says "No user examples available yet" for that category). So I went ahead and built one — a Brownian particle diffusion model using mesa.experimental.continuous_space. Just opened a PR:
mesa/mesa-examples#433

The model has particles doing random walks with soft repulsion so they spread out naturally, and colours them based on local density using a plasma colormap. Pretty simple but it shows the full setup: ContinuousSpaceAgent, SolaraViz, DataCollector, Scenario params.

For the actual GSoC project I'm thinking about focusing on:

Fixing the remaining Mesa 4.x compatibility issues (there are still a few open)
Adding more continuous/network space examples since those categories are thin
Setting up a GitHub Actions workflow that catches when examples break on new Mesa releases (currently there's no automated check for this)

Happy to discuss scope. Is there a particular part of the revival that's highest priority from the team's side right now?

0 replies

alaareda121 · 2026-03-23T21:43:24Z

alaareda121
Mar 23, 2026

Hi! I'm Alaa, a CS student from Egypt at New Mansoura University, specializing in Artificial Intelligence. I found Mesa through the GSoC 2026 website and I'm really interested in the Mesa Examples Revival project.
I've started exploring the mesa-examples repository and I'd like to contribute. My first step will be to try running some examples locally and document which ones work and which ones break with the latest Mesa version.
Is this a good starting point? Any guidance would be appreciated!

1 reply

niteshver Mar 24, 2026

I think it would be helpful to explore the documentation and review a few example implementations before starting your contribution.

alaareda121 · 2026-03-24T01:30:02Z

alaareda121
Mar 24, 2026

Update: I successfully ran the forest_fire example locally with Mesa 3.5.1! I can see the fire spreading and tested different tree density parameters. Looking forward to exploring more examples.

0 replies

alaareda121 · 2026-03-25T22:45:47Z

alaareda121
Mar 25, 2026

Thank you for the guidance! I have already explored the documentation and ran the forest_fire example locally. I found two issues: matplotlib is missing from the dependencies, and the code produces deprecation warnings about Mesa 4.0. I am now reviewing more examples to document similar issues before writing my proposal.

…

On Tue, Mar 24, 2026 at 3:53 AM Nitesh verma ***@***.***> wrote: I think it would be helpful to explore the documentation and review a few example implementations before starting your contribution. — Reply to this email directly, view it on GitHub <#2927?email_source=notifications&email_token=B2ZA4SK3EVCYSSZ2HU5VWML4SHTC5A5CNFSNUABIM5UWIORPF5TWS5BNNB2WEL2ENFZWG5LTONUW63SDN5WW2ZLOOQXTCNRSHA2TKMZQUZZGKYLTN5XKOY3PNVWWK3TUUVSXMZLOOSWGM33PORSXEX3DNRUWG2Y#discussioncomment-16285530>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/B2ZA4SIZS3MAAKG35LJVC2T4SHTC5AVCNFSM6AAAAACOEGUB6GVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTMMRYGU2TGMA> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

TheMS1314 · 2026-03-26T08:01:51Z

TheMS1314
Mar 26, 2026

Hey everyone! I'm Meer.
I've been exploring the mesa-examples repo and got drawn into the Emperor's Dilemma model. Reading through @EwoutH's post about reviving mesa-examples and @Tushar1733's audit, I noticed Emperor's Dilemma is one of the examples that's still working but hasn't been meaningfully extended since it was added.
I've been working on three specific extensions:
A Whistleblower agent subclass that's immune to social pressure and always acts on private belief. The interesting research question is how many whistleblowers it takes to trigger a cascade collapse of the norm — even a small fraction can flip the whole grid under the right conditions.
A Belief Gap metric that tracks the average distance between private belief and public behavior across all agents at each step. This directly measures the illusion of consensus that the model is built around, which the current DataCollector doesn't capture at all.
Norm collapse detection that records exactly which step compliance first drops below 50%, turning a visual observation into an actual measurable data point.
The base model behavior is fully preserved — setting whistleblowers to 0 reproduces the original exactly.
Before opening a PR I wanted to check if this direction makes sense as part of the revival effort, and whether there's anything about the Emperor's Dilemma specifically that maintainers want addressed first.

0 replies

niteshver · 2026-03-26T15:14:25Z

niteshver
Mar 26, 2026

Hi @colinfrisch , @jackiekazil

Hi, I’ve been exploring Mesa-LLM and identified two user-facing issues that seem important for usability and scalability.

First, LLM outputs are recomputed on every run, which makes repeated simulations slow for local models, costly for API-based models, and difficult to reproduce or reuse offline.

Second, there’s currently limited visibility and control over LLM usage (calls, tokens, latency, cost), which can make larger simulations harder to manage and potentially expensive.

To address these, I’m thinking of working on two related features:
(1) a lightweight record/cache/replay system for LLM outputs, and
(2) a usage-tracking layer with basic budget controls and rate-limit safeguards.

My goal is to keep both solutions simple and practical so they improve developer experience without adding too much complexity. I’d appreciate your thoughts on whether this direction aligns with the project goals.

I would like to structure my proposal around two core execution-layer issues in Mesa-LLM: the lack of reusable record/cache/replay support for LLM outputs, and the lack of built-in usage tracking and budget controls. I think these are both real and impactful problems that affect reproducibility, local inference workflows, cost-awareness, and overall usability.

0 replies

nazifsco · 2026-03-27T12:24:09Z

nazifsco
Mar 27, 2026

Hi @jackiekazil, @colinfrisch, and @EwoutH!

I am Nazeef Danladi Adamu, a postgraduate student from Nigeria applying for the Mesa-LLM Stabilization project (175 hours).

My background is in building AI orchestration systems. I recently built research-dossier-app, a full-stack tool that coordinates multiple LLM capabilities through a directive-execution architecture. The core challenge in that project was making LLM calls reliable enough to embed inside an automated pipeline where a silent hang or an unexpected crash breaks everything downstream. That is exactly what this project addresses at the framework level, which is why it caught my attention immediately.

Before writing my proposal I read through the mesa-llm 0.3.0 source code and found five concrete issues:

No async timeout protection -- aplan() in reasoning/react.py calls agenerate() with no timeout, which hangs the entire simulation if one LLM call stalls
Memory hardcoded and not configurable -- LLMAgent.__init__ always creates STLTMemory with fixed capacity, no way to swap backends without subclassing
Unicode crash on Windows -- send_message() includes a Unicode arrow that raises UnicodeEncodeError on cp1252 terminals, silently breaking every Windows user
Mesa 4.0 compatibility break -- direct imports from mesa.space and mesa.discrete_space will ImportError the moment a user upgrades Mesa
Unhelpful ValueError for missing step_prompt -- new users have no idea what the parameter is or how to set it

I posted detailed findings with proposed fixes in mesa-llm/discussions/269.

My full proposal covers all five stabilization issues, a Mesa 4.0 migration path with version-conditional imports, and example models drawn from scientific literature that demonstrate LLM agents alongside standard agents.

Happy to answer any questions or share the proposal draft before the deadline. Thank you for the work on this project.

0 replies

FarseenSh · 2026-03-31T12:17:41Z

FarseenSh
Mar 31, 2026

Hi, I'm Farseen Shaikh — applying for the Behavioral Framework project.

I started with the recommended starter task: a needs-based Wolf-Sheep extension with continuous internal states (hunger, fear, fatigue) driving priority-based decisions. PR: mesa/mesa-examples#456. Friction points documented in #2538.
My background is in RL and representation engineering. In my SweeperLLM paper (arXiv), I fine-tuned LLMs to play Minesweeper and found that board state representation matters more than training method — the same challenge applies to designing observation spaces for RL agents in Mesa.

Plan: build 2–3 models (needs-based ✓, RL-based next, optionally BDI), document friction, compare with GAMA's gymnasium wrapper, and propose improvements justified by implementation experience.

0 replies

awasarkaryadnyesh29 · 2026-04-04T16:22:18Z

awasarkaryadnyesh29
Apr 4, 2026

Hi everyone! I'm Yadnyesh Awasarkar, a student from India interested in the Mesa Examples Revival project for GSoC 2026.
I have gone through the Mesa tutorial, built the Boltzmann Wealth Model, and submitted my first PR to mesa-examples. I received feedback that this model already exists in mesa-examples, so I am now planning to build a Forest Fire Model as my next contribution.
I have already submitted my GSoC proposal for the Mesa Examples Revival project. I would love any feedback or guidance from the mentors @jackiekazil and @EwoutH on what examples would be most valuable to add.
Thank you for this opportunity! Looking forward to contributing to Mesa!

0 replies

jackiekazil · 2026-04-04T18:40:29Z

jackiekazil
Apr 4, 2026
Maintainer

All we appreciate all the submissions for GSoC. We are now in review mode. I am going to close this thread.

@EwoutH feel free to reopen if you think we still need it open.

0 replies

nazifsco · 2026-04-06T20:18:52Z

nazifsco
Apr 6, 2026

Hi @jackiekazil, @colinfrisch, and @EwoutH!

I am Nazeef Danladi Adamu, a postgraduate student from Nigeria applying for the Mesa-LLM Stabilization project (175 hours).

My background is in building AI orchestration systems. I recently built research-dossier-app, a full-stack tool that coordinates multiple LLM capabilities through a directive-execution architecture. The core challenge in that project was making LLM calls reliable enough to embed inside an automated pipeline where a silent hang or an unexpected crash breaks everything downstream. That is exactly what this project addresses at the framework level, which is why it caught my attention immediately.

Before writing my proposal I read through the mesa-llm 0.3.0 source code and found five concrete issues:

No async timeout protection -- aplan() in reasoning/react.py calls agenerate() with no timeout, which hangs the entire simulation if one LLM call stalls
Memory hardcoded and not configurable -- LLMAgent.__init__ always creates STLTMemory with fixed capacity, no way to swap backends without subclassing
Unicode crash on Windows -- send_message() includes a Unicode arrow that raises UnicodeEncodeError on cp1252 terminals, silently breaking every Windows user
Mesa 4.0 compatibility break -- direct imports from mesa.space and mesa.discrete_space will ImportError the moment a user upgrades Mesa
Unhelpful ValueError for missing step_prompt -- new users have no idea what the parameter is or how to set it

I posted detailed findings with proposed fixes in mesa-llm/discussions/269.

My full proposal covers all five stabilization issues, a Mesa 4.0 migration path with version-conditional imports, and example models drawn from scientific literature that demonstrate LLM agents alongside standard agents.

Happy to answer any questions or share the proposal draft before the deadline. Thank you for the work on this project.

1 reply

Nithurshen Apr 7, 2026

Deadline to submit the proposal is March 31, 18:00 UTC

Uh oh!

GSoC 2026 ideas #2927

Uh oh!

Uh oh!

EwoutH Dec 5, 2025 Maintainer

Replies: 36 comments · 39 replies

Uh oh!

EwoutH Dec 5, 2025 Maintainer Author

Uh oh!

Uh oh!

EwoutH Dec 6, 2025 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

EwoutH Dec 10, 2025 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

EwoutH Dec 6, 2025 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

colinfrisch Dec 9, 2025 Maintainer

Uh oh!

Uh oh!

Uh oh!

EwoutH Dec 6, 2025 Maintainer Author

Reimagining Model Execution and Experimental setup

Rethinking Data Collection and Management

Uh oh!

Uh oh!

EwoutH Dec 12, 2025 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

EwoutH Jan 8, 2026 Maintainer Author

Reviving Mesa-examples

Uh oh!

Uh oh!

Uh oh!

EwoutH Jan 27, 2026 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Example Metadata

Automation Pipeline

Uh oh!

Uh oh!

EwoutH Jan 15, 2026 Maintainer Author

Uh oh!

Uh oh!

Uh oh!

EwoutH Jan 24, 2026 Maintainer Author

Uh oh!

EwoutH
Dec 5, 2025
Maintainer

Replies: 36 comments 39 replies

EwoutH
Dec 5, 2025
Maintainer Author

EwoutH Dec 6, 2025
Maintainer Author

EwoutH Dec 10, 2025
Maintainer Author

EwoutH Dec 6, 2025
Maintainer Author

colinfrisch Dec 9, 2025
Maintainer

EwoutH
Dec 6, 2025
Maintainer Author

EwoutH
Dec 12, 2025
Maintainer Author

EwoutH
Jan 8, 2026
Maintainer Author

EwoutH Jan 27, 2026
Maintainer Author

EwoutH Jan 15, 2026
Maintainer Author

EwoutH Jan 24, 2026
Maintainer Author