I work at the intersection of cloud infrastructure, agentic AI systems, reinforcement learning, and multi-agent trust. This profile is where the technical and research work lives.
Reinforcement learning
- pokemon-red-ai · Reinforcement learning research on observation representations in Pokémon Red. Pixel, symbolic, and hybrid conditions under capacity-matched encoders. RecurrentPPO via SB3-Contrib.
Agentic systems
- claude-teams-operator · Kubernetes operator that runs Claude Code agent teams as distributed pods.
- ea-agent · AI personal executive assistant built around Obsidian.
Research tooling
- overleaf-mcp · MCP server for editing Overleaf LaTeX projects from Claude. Published to PyPI. Single-user by design, auditable, built for academic researchers.
- paper-skills · Claude Code skills for academic paper triage and Obsidian integration.
Applied experiments
- golf-coach-agent · Vision LLM applied to golf swing analysis.
Identity and tooling
- alanchester-brand · Personal brand system, expressed as code. Tokens, components, the equation as identity.
- engineering-handbook · Personal engineering handbook. Philosophies, workflows, and tooling for how I build software. Versioned with semver.
- mac-dev-setup · One command from zero to productive on macOS.
Research and writing on reinforcement learning, agentic systems, multi-agent trust, and team architectures for the agentic era. Currently drafting a paper for ARLET 2026 (NeurIPS workshop).
The question: in long-horizon reinforcement learning, do symbolic observations beat pixel observations once encoder capacity is properly controlled? Pokémon Red is the empirical environment.
On reinforcement learning. How does observation representation affect sample efficiency in long-horizon reinforcement learning? Pixel versus symbolic versus hybrid observations under capacity-matched encoders are the conditions under study.
On team architecture. How does the unit of work change when teams include autonomous agents, and what structures support variable-elasticity teams?
On multi-agent security. How does trust topology affect security in multi-agent LLM systems? Centralized orchestrator versus peer-authenticated trust.
About fifteen years across kernel engineering, Kubernetes platform development, cloud security, and product leadership. Currently leading a team of product managers at Oracle Cloud Infrastructure.
Dual B.S. in Computer Science and Applied Mathematics, NC State.
MBA candidate, NC State Jenkins Graduate School of Management.
- Brand system · alanchester-brand
- Personal site · alanchester.com (coming)
- LinkedIn · alan-chester
- Email · amcheste@gmail.com
δ ε
