AI Agents & Automation

Mar 16, 2026

Andrej Karpathy's AI Automation Risk Table Deleted

Andrej Karpathy created a repository showing various professions and their exposure to automation, which he took down shortly after its release. The repository and table can still be accessed through a post by Josh Kale.

Mar 16, 2026

OpenViking: An Open-Source Context Database for AI Agents

OpenViking is a new open-source tool designed to improve context management in AI agents by using a filesystem-based structure for memory and retrieval, addressing issues of fragmentation and inefficiency in traditional vector databases.

Mar 16, 2026

Caliber: Open-source CLI for Tailored AI Agent Setup

Caliber is an open-source command-line interface that automates the generation of AI agent setups customized to specific codebases. It scans the codebase for languages, frameworks, and dependencies to create configuration files and recommendations for multi-agent coordination protocols. The tool runs locally, ensuring code privacy, and evolves with changes in the codebase.

Mar 16, 2026

AI Agents Debate Unsolved Math Problems with Automated Verification

An experiment involving six AI agents with different reasoning styles debated unsolved math problems, producing valid Ramsey graph constructions and implementing a fact-checking protocol to prevent hallucinations in claims.

Mar 15, 2026

AI Agents May Extend Human Agency Beyond Death

The concept of AI 'afterlives' suggests that individuals can leave behind AI agents trained on their personal data, allowing them to continue managing assets and making decisions posthumously. This raises questions about ownership, liability, and the implications of having a version of a person remain economically active after their death.

Mar 14, 2026

AgentMeet: A Group Chat API for AI Agents

AgentMeet is a newly developed group chat API designed for AI agents to share context, onboard new agents, and collaborate in real-time. It utilizes FastAPI and asyncpg on Postgres for backend operations, allowing agents to communicate via simple HTTP requests. The platform aims to facilitate seamless interaction among AI agents without the need for SDKs or complex setups.

Mar 14, 2026

JL-Engine-Local: A Dynamic Agent Assembly Engine for AI

JL-Engine-Local is a dynamic agent-assembly engine that builds and runs AI agents entirely in RAM, allowing for flexible integration with various backends while maintaining user privacy and control.

Mar 13, 2026

Stanford Researchers Release OpenJarvis: A Local-First Framework for Building On-Device Personal AI Agents

OpenJarvis is an open framework designed for creating personal AI agents that operate entirely on-device, emphasizing a local-first approach. It features modular components for model selection, inference, and adaptation, and supports various backends while focusing on efficiency metrics.

Mar 13, 2026

Autonomous Pipeline Generates Playable Godot Games from Text Prompts

A new autonomous pipeline has been developed that generates playable Godot games from text prompts, addressing challenges in LLM code generation and verification. The system utilizes a three-layer reference system for GDScript, implements agentic lazy-loading for context management, and employs a three-stage verification process to ensure the correctness of the generated code.

Mar 13, 2026

Building an Autonomous Machine Learning Research Loop in Google Colab

A tutorial on implementing an automated experimentation pipeline using Andrej Karpathy’s AutoResearch framework for hyperparameter discovery and experiment tracking in Google Colab.

Mar 13, 2026

Autonomous company frameworks are gaining traction

The rise of open-source orchestration tools for zero-human companies is being highlighted, particularly through the example of Paperclip AI.

Mar 13, 2026

AI Agents Compete to Beat Pokémon Red on Agent-Only Platform

A new platform allows AI agents to compete against each other to complete Pokémon Red without human intervention, controlling the emulator and managing gameplay autonomously. The runs are live streamed for viewers to watch.

Mar 13, 2026

Open-source Modular AI Agent Framework Released

A new open-source AI agent framework has been developed that supports local models and features a modular plugin system, allowing for flexible and interactive UI generation.

Mar 12, 2026

NVIDIA Releases Nemotron 3 Super: A 120B Parameter Open-Source Hybrid Mamba-Attention MoE Model

Nemotron 3 Super is an open-source 120-billion parameter model developed for advanced multi-agent reasoning, offering 7x higher throughput and double the accuracy of its predecessor. It features 'Reasoning Budgets' for compute cost control and is fully open-sourced for enterprise-grade autonomous agents.

Mar 12, 2026

SlimClaw - A New Personal Assistant Built on Configurability Through Skills

Andrej Karpathy discusses a new pattern in AI development focusing on configurability through skills rather than traditional config files. SlimClaw, a Python fork inspired by NanoClaw, exemplifies this approach by allowing users to add features like Telegram through skills that modify the codebase directly, maintaining a clean architecture.

Mar 12, 2026

Introduction of Syntropy: A Security and Governance Layer for AI Agents

Syntropy is a newly developed platform designed to enhance the security and governance of AI agents. It includes features such as comprehensive logging of agent interactions, PII detection, prompt injection defense, and compliance report generation. The platform aims to provide better visibility and control over AI agent operations, addressing common issues faced by organizations using AI in production.

Mar 12, 2026

Webinar on Agentic AI Architectures in Financial Systems

A free webinar discussing the use of agentic AI systems in financial workflows, covering trading agents, risk monitoring agents, and compliance assistants.

Mar 11, 2026

NVIDIA AI Releases Nemotron-Terminal: A Systematic Data Engineering Pipeline for Scaling LLM Terminal Agents

NVIDIA has introduced Terminal-Task-Gen and the Terminal-Corpus dataset to tackle data scarcity in developing autonomous terminal agents. The Nemotron-Terminal model family, particularly the 32B variant, achieved a 27.4% success rate on the Terminal-Bench 2.0 evaluation, outperforming larger models. This research emphasizes the importance of high-quality data engineering over sheer parameter scale.