Turning Agents Into Sharable Software: Inside Docker’s cagent A small utility that feels like a platform: Docker's cagent turns AI agents into sharable, runnable software. Teams often build a clever assistant or a small cluster of cooperating agents. Then real-w... AI agents Docker Go MCP Open Source Reproducibility
Paper2Agent: Transforming Research Papers into Interactive AI Agents Research papers are traditionally require readers and reviewers to interpret code, methods, and results independently. Paper2Agent aims to transform published research into interactive AI agents allow... AgentScope AI agents AutoGen Azure AI Agents Claude Code Code2MCP computational biology LangGraph MCP NotebookLM OpenAI Assistants OpenDevin open source reproducibility Stanford tutorial extraction
Smolagents: Code-first, Multi-step AI agents smolagents is Hugging Face 's small, intentionally minimal Python library for building agentic systems that think in code. It gives you a multi-step loop, tools, and models without a forest of abstrac... AI agents CodeAgent Hugging Face MCP Open Source smolagents
Agentic Web Interfaces: Building Websites That Welcome AI Agents The paper "Build the web for agents, not agents for the web" proposes a reorientation of how we design the modern web: rather than forcing AI systems to operate through human-centered pages, we should... AWI MCP Playwright Safety web agents WebArena
The Model Context Protocol Registry: Building the Backbone for AI Server Discovery The Model Context Protocol Registry is an open, standards-driven catalog and API for MCP servers. If you are building or running AI tools that speak MCP, the registry is the connective tissue that tur... AI registry API DevOps Enterprise Go Kubernetes MCP Open source Package validation Subregistries
Context Engineering: How to Build High‑Signal AI Agents Context Engineering: How to Build High‑Signal AI Agents Context is the new battleground for AI agents. While the focus had been on prompts and models, the real difference between demos and production ... AI agents coding agents compaction context engineering MCP RAG subagents tool design
MCP-Universe: Real-World Benchmarking For Agents That Use MCP The Model Context Protocol (MCP) has quickly become a common interface for connecting large language models to external tools and data. By design, it looks like a USB-C port for AI applications: a sta... benchmark LLM agents MCP Salesforce AI Research tool use
Claude Code PM: Shipping Better Software with Spec-driven, Parallel Workflows Claude Code PM from Automaze is a pragmatic workflow for turning product specs into epics, GitHub issues, and ultimately production-grade code, all with full traceability. The repository documents a c... Agentic workflows AI engineering Automaze Claude GitHub Issues MCP Project management
DeepCode: An Open Agentic Coding System That Turns Papers Into Code DeepCode from the Data Intelligence Lab at The University of Hong Kong (HKUDS) is an open agentic coding system that turns research papers and natural language requirements into working code. It aims ... agents code-generation HKU MCP Paper2Code Python RAG research-reproduction Streamlit
Agent Client Protocol: Making Agentic Editing Portable The Agent Client Protocol (ACP) is an open effort by Zed Industries to standardize how code editors talk to AI coding agents. If the Language Server Protocol decoupled languages from editors, ACP aims... ACP AI Agents Code Editors JSON-RPC MCP Open Source Rust TypeScript
mcp-agent: Build composable AI agents on MCP mcp-agent from LastMile AI is a Python framework that connects the dots between the low-level Model Context Protocol and high-level, production-friendly agent patterns. It manages MCP server lifecycle... Agents AI Frameworks Anthropic LastMile AI MCP OpenAI Swarm Python
Introducing LiveMCPBench: Evaluating Models on Large Tool Set Usage A new arXiv preprint, LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools , from the Chinese Academy of Sciences and UCAS, introduces a benchmark to test AI agents in realistic tool-rich environme... AI benchmarking AI tools Artificial Intelligence MCP MCP Server