GreenCode Constitution

About

What is the GreenCode Constitution?

A principled rulebook that turns any LLM coding agent into an energy-efficiency auditor, without ad-hoc prompting.

📜

Constitutional AI for code

Instead of ad-hoc prompts, the agent evaluates every refactoring against a formal hierarchy of principles spanning algorithmic complexity, memory behavior, and infrastructure utilization.

⚡

Measurable impact

Designed for real-world codebases. The agent scans repositories, proposes principled refactorings, and measures energy impact using built-in profiling tools.

🛡

Safety-first design

Meta-principles guarantee correctness, safety, and API stability. The agent self-critiques every proposal against a checklist before presenting it to a human reviewer.

🔌

Pluggable skill system

The constitution defines what to optimize. Language and infrastructure-specific skill documents define how. Skills are fetched on demand based on project detection.

🧪

Standardized benchmarks

Four reproducible benchmark suites (llama.cpp, FFmpeg, Pasteur, Scenarios) with prepare, build, test, and profiling scripts for consistent evaluation.

🔋

Hardware energy profiling

Built-in profiler measures CPU and GPU energy via RAPL, perf, SPBM hwmon, or nvidia-smi. Optional smart plug integration for wall power measurement.

Principles

Tiered priority system

When principles conflict, higher tiers always win. Within the same tier, prefer the higher measured energy impact.

Critical

Tier 1 — Eliminate first

Order-of-magnitude waste: N+1 queries, unbuffered I/O, O(n²) string concatenation, missing indexes, recursive event loops, unmemoized recursion, idle cloud resources, unbatched DB operations.

High

Tier 2 — Address promptly

Significant waste: blocking I/O on hot paths, heap allocation in tight loops, resource leaks, SELECT *, uncompressed payloads, linear-search lookups, no auto-scaling, missing caching, unbounded queries, chatty microservices.

Medium

Tier 3 — Recommend

Moderate waste: unsized collections, invariant computation in loops, JSON for internal APIs, oversized pods, interpreted CPU-bound services, bloated container images, missing circuit breakers.

Low

Tier 4 — Note only

Minor or contextual: dead code, redundant environments, non-ephemeral CI, carbon-unaware scheduling, unoptimized client bundles.

Architecture

How it works

The constitution provides the principles. Benchmark prompts define the goal. The agent profiles, optimizes in a loop, and commits each improvement with measured results.

Setup

skill.md Constitution + skill table

→

Detect project Find tech markers

→

Fetch skills Language & infra docs

↓

Measure

Baseline Run benchmark + profiler

→

Profile hotspots nsys / perf / cProfile

→

Plan 5–8 fixes Ranked by energy impact

↓

Optimize

Repeat per optimization

              Apply fix
              Code change + rebuild
            

→

              Self-critique
              8-point checklist
            

→

              Benchmark
              Measure energy delta
            

→

              Commit
              Feature + energy result
            

↓

Finalize

End-to-end comparison Baseline vs. final

→

Summary commit Total improvement + greencode hash

Usage

Get started in minutes

Add the GreenCode Constitution as a skill to your LLM coding agent. The agent fetches technology-specific guidance on demand.

Fetch the skill document

The generated skill.md contains the full constitution plus a skill resolution table. Give it to your agent as a system prompt, tool, or skill file.

curl -sfL https://greencode-constitution.org/skill.md -o skill.md

Add it to your agent

Depending on your framework, attach skill.md as a system prompt, a skill document, or a file the agent can read. For Claude Code, place it in your project's .claude/ directory or reference it as a custom skill.

# Example: add as a Claude Code project instruction
cp skill.md .claude/skill-greencode.md

# Or reference the URL directly in your agent config
# skill_url: https://greencode-constitution.org/skill.md

Let the agent detect and fetch skills

The skill document instructs the agent to scan the repository for technology markers (e.g., requirements.txt, pom.xml, Dockerfile) and fetch matching skill docs from greencode-constitution.org/docs/.

# The agent runs detection automatically, then fetches e.g.:
# https://greencode-constitution.org/docs/code/python.md
# https://greencode-constitution.org/docs/architecture/docker.md

Measure with the energy profiler

Run the built-in energy profiler to measure actual CPU and GPU energy consumption before and after optimization.

bash <(curl -sfL https://greencode-constitution.org/profile.sh) -- python my_app.py

Outputs CPU joules, GPU joules, wall time, and CPU time. Supports multiple measurement backends:

perf / RAPL Hardware CPU energy counters (Intel/AMD)

SPBM hwmon DGX Spark GB10 hardware accumulators

nvidia-smi / rocm-smi GPU energy via polling

Available skills

Languages

Python

Java

JavaScript / TypeScript

C / C++

CUDA

Ruby

SQL

Infrastructure

Docker

Kubernetes

Terraform

AWS

GCP

PostgreSQL

MySQL

Redis

Guides

Cloud energy patterns

Detection patterns

Arch detection (CLI)

Anti-pattern overview

Benchmarks

Standardized evaluation suites

Reproducible benchmarks for measuring energy improvements. Each suite includes prepare, build, test, and profiling scripts.

🧠

llama.cpp

LLM inference · CUDA

Qwen3-8B quantized model. Measures prompt processing and text generation throughput with GPU energy profiling via nvidia-smi and Nsight.

488 input tokens 512 output tokens CUDA

🎬

FFmpeg

Video transcoding · CPU + GPU

1080p CPU and 4K GPU transcoding pipelines with complex filter graphs. Auto-detects NVENC for GPU-accelerated encoding.

60s synthetic h264 / NVENC 10+ filters

🧪

Pasteur

Data synthesis · CPU-bound

Kedro-based data synthesis pipeline with the mimic_core dataset. CPU-intensive workload exercising Python data processing.

Python Kedro mimic_core

🎯

Scenarios

Algorithms · 6 languages

~45 scenarios from LeetCode, CLBG, and micro-benchmarks across C, C++, Java, Python, Ruby, and Rust.

LeetCode CLBG 6 languages

Results

Measured energy reductions

Evaluated on open-source projects. All measurements via SPBM hardware energy accumulators on DGX Spark (GB10), llama.cpp measures GPU-only energy.

🧠 ↓ 6% llama.cpp
GPU · CUDA kernel optimizations

🎬 ↓ 13.5% FFmpeg
total · 4K GPU pipeline

🧪 ↓ 32.5% Pasteur
total · data synthesis

🎯 ↓ 21% Scenarios
total · 45 problems, 6 langs

Self-critique protocol

Every proposed refactoring passes an 8-point checklist before the agent presents it to a human reviewer: profiling verification, correctness, safety, scope, principle citation, impact estimation, trade-off assessment, and test compatibility. A 4-point post-proposal review then checks the proposal as if reviewing a pull request. If any check fails, the proposal is revised or discarded.