HexStrike AI: Giving LLMs Safe, Auditable Hands in Offensive Security Turning AI agents into red teamers HexStrike AI is an open-source attempt to wire large language model (LLM) agents directly into a real offensive security toolbox. It exposes a local API and Model Co...
HexStrike AI Automated Cybersecurity HexStrike AI is transforming how organizations approach penetration testing and vulnerability intelligence by harnessing the power of advanced security tools through a simple, conversational prompt. B... AI security automation chatgpt cybersecurity tools penetration testing red teaming vulnerability management
SmallThinker: Bringing Powerful Language Models to Local Devices Researchers from Shanghai Jiao Tong University’s Institute of Parallel and Distributed Systems, the School of Artificial Intelligence, and Zenergize AI introduced SmallThinker : a family of large lang... AI Models AI training reinforcement learning
Teaching AI to Teach: Sakana AI's New Approach to Training Reasoning Models Introduction: The High Cost of AI Reasoning Training large language models (LMs) to reason like humans is one of the biggest challenges in AI today. The current state-of-the-art method, reinforcement ...
Introducing LiveMCPBench: Evaluating Models on Large Tool Set Usage A new arXiv preprint, LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools , from the Chinese Academy of Sciences and UCAS, introduces a benchmark to test AI agents in realistic tool-rich environme... AI benchmarking AI tools Artificial Intelligence MCP MCP Server
UI-TARS Desktop: A Native GUI Agent That Operates Your Computer With Language UI-TARS Desktop: A Native GUI Agent That Operates Your Computer With Language UI-TARS Desktop is an open source desktop application that lets you control a computer and browser using plain English ins...
Activepieces: Open Source AI Automation Meets MCP Activepieces is an open source, AI-first automation platform that aims to be a friendly, extensible alternative to closed SaaS workflow tools. Its TypeScript-based "pieces" framework, hundreds of read...
Coroot: Open-Source Observability That Turns Telemetry Into Decisions Coroot is an open-source observability platform that collects metrics, logs, traces, and profiles and then goes a vital step further: it turns raw telemetry into explanations and next steps. The proje...
Anomalib: Visual Anomaly Detection for Industry 4.0 open-edge-platform Organization anomalib An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inferenc...
JSON Crack: Turning Complex Data Into Clickable Diagrams JSON Crack is an open source web app that transforms JSON and other structured data formats into interactive, navigable diagrams. Instead of squinting at curly braces, you get a living map of your dat...
al-folio: A Jekyll Theme That Makes Academic Sites Feel Effortless The alshedivat/al-folio repository is a widely adopted, open-source Jekyll theme designed for academics and research labs. It ships with opinionated layouts for publications, people, proje...
Inside OpenAI's Agents SDK for Python The OpenAI Agents SDK (Python) is a compact, batteries-included toolkit for building agents that can call tools, hand off control, and trace their own reasoning across steps. It aims to make multi-age...