AI Tools | data-centered

Evaluation & Benchmarks

Testing frameworks and benchmarking tools for AI systems.

6 resources

Hover over a resource to preview

Evaluation & Benchmarks 6

├── ADE-Bench (dbt Labs) Github dbt-labs ~ → ├── LLM Council (Karpathy) Github a-karpathy ~ → ├── Origon: Production-Grade AI Agent Platform Origon origon-team ~ → ├── Arize Phoenix: AI Observability & Evaluation Github arize-ai ~ → ├── Better Agents: Standards for Agent Building Github langwatch ~ → └── The Importance of Agent Harness in 2026 Philschmid p-schmid ~ →

Other Tools 11

├── Advanced Tool Use (Anthropic) Anthropic anthropic-team ~ → ├── Consistent Hero Images Workflow Open j-ouyang ~ → ├── CocoIndex: Data Transformation Framework for AI Cocoindex cocoindex-team ~ → ├── Tool Search is Dead, Long Live Skills Nicolaygerold n-gerold ~ → ├── Gemini Deep Research Gemini google ~ → ├── 21st.dev: AI Product Designer 21st 21st-dev ~ → ├── Custom AI Agent Builder's Guide Motherduck motherduck-team ~ → ├── Towards a Disaggregated Agent Filesystem on Object Storage Penberg p-enberg ~ → ├── Agentic Coding Flywheel Setup Github d-marx ~ → ├── Convex AI Chat Template Convex convex-team ~ → └── Workflow DevKit for AI Agents Workflowdevkit workflow-team ~ →

CLI & Utilities 8

├── mdflow - Executable Markdown Github j-lindquist ~ → ├── summarize.sh: Web Content Extraction CLI Summarize p-steinberger ~ → ├── Tigma: Terminal-Based ASCII Design Tool Github j-longster ~ → ├── Agentic Coding Flywheel: VPS Bootstrap System Github j-swannack ~ → ├── Peaky Panes: TUI Project Manager Github regenrek ~ → ├── Tigma: AI-Driven Design Tool Github p-chaganti ~ → ├── PeakyPanes: Cursor Window Manager Github t-reckart ~ → └── Dots: Lightweight Task Tracking Github j-lowin ~ →

Memory Systems 5

├── claude-mem - Persistent Memory Github thedotmack ~ → ├── OpenMemory: Local Memory Store for LLM Apps Github cavira ~ → ├── Semantic Memory: Local Vector Search with PGlite Github j-hooks ~ → ├── Memory Lane: Persistent Memory for Claude Code Gist a-hillman ~ → └── Context Field for Cursor Github gmtstudio ~ →

MCP & Protocols 6

├── MCP Deep Dive: The USB-C Layer for AI Newsletter g-orosz ~ → ├── mcp-use: Full-Stack MCP Framework Github mcp-use-team ~ → ├── Agent Skills: Open Format for Agent Capabilities Agentskills anthropic-team ~ → ├── MCP Apps: Interactive UI Extension Specification Blog mcp-community ~ → ├── MotherDuck MCP Server Motherduck motherduck-team ~ → └── MCP UI Desktop Client Github n-oxford ~ →

Claude Code Tools 10

├── Oh-My-OpenCode: Multi-Agent Plugin System Github y-code ~ → ├── 5 Fixes for Claude Skills Failures Open a-francis ~ → ├── Claude Use Cases Directory Claude anthropic-team ~ → ├── Claude Code Templates: Stack Builder Aitmpl aitmpl ~ → ├── Compound Engineering: Claude Code Plugin Github every-inc ~ → ├── Continuous Claude: Context Management System Github parcadei ~ → ├── Coding Tutor: Personalized AI Learning Plugin Github n-agarwal ~ → ├── Writing a Good CLAUDE.md Humanlayer kyle-humanlayer ~ → ├── Awesome Claude: Marketplace Directory Github g-mickel ~ → └── Claude Commands and Prompts Guide Nurijanian g-nurijanian ~ →

Agent Platforms 8

├── Simular AI: Autonomous Computer Use Agent Simular simular ~ → ├── Agentic Data Scientist Github k-dense-ai ~ → ├── Markdown Site: AI-Ready Publishing Framework Github w-sutton ~ → ├── Swarms: Enterprise Multi-Agent Orchestration Github swarms-team ~ → ├── Agent-Native Architectures: How to Build Apps After Code Ends Every d-shipper ~ → ├── ClawdBot: Claude Discord Bot Github zolastic ~ → ├── Claude Delegator: Task Routing Agent Github k-leneway ~ → └── AgentFS: Filesystem for AI Agents Github p-enberg ~ →

Documentation & Knowledge 5

├── CodeWiki: Repository-Level Documentation Framework Github fsoft-ai4code ~ → ├── AI Builder's Guide: Building Analytics Agents Motherduck motherduck-team ~ → ├── Markdown Site Generator Github domdomegg ~ → ├── Obsidian Skills: AI Agent Behaviors Github c-cielecki ~ → └── SpecStory: AI Coding Session History Specstory specstory-team ~ →

📦 SOURCE

×

Resource Title

Resource description goes here.

ai-tools implementation

👤 Author Name