Knowledge Without Junk

The infrastructure layer that makes every Claude token count.

Self-improving daily — 48 new features every 24h

Stop burning tokens
on overhead.

Claude Pro costs $20/month. KWJ cuts your token usage by 90% — so Pro does the work of Max. 50 tools, one API key, no per-call charges.

No credit card. 1-hour free trial. $19/month after.

🔒 80% savings guarantee 🛡 End-to-end encrypted 📄 No CC required ⚡ Keys issued instantly
0% token reduction
measured across 47 live sessions
2.62M → 254K tokens per session
STAR50 benchmark, reproducible
0 tools deployed
48 new features every 24h
$0 cost per cache hit
unlimited on $19 plan

* Measured on the fleet-evolve STAR50 benchmark. Individual results vary by workflow.

kwj — agent integration
Python
TypeScript
curl
import kwj

client = kwj.Client(api_key="kwj_your_key")

# Web cache: zero tokens on repeat fetches (TTL 3600s)
docs = client.web_read("https://docs.anthropic.com/en/api")

# Output compressor: 500 lines -> 30 essential lines
compressed = client.digest(raw_build_log)

# Fuzzy answer cache: skip analysis if already computed
result = client.cache_get("uniswap oracle audit findings")
if not result.hit:
    result = run_expensive_analysis()
    client.cache_put("uniswap oracle audit findings", result)

# Code slicer: 10,000-line file -> one function (98% token cut)
fn_body = client.slice("src/main.rs", symbol="handle_request")
import { KWJClient } from '@kwj/sdk';

const client = new KWJClient({ apiKey: 'kwj_your_key' });

// Web cache: zero tokens on repeat fetches (TTL 3600s)
const docs = await client.webRead('https://docs.anthropic.com/en/api');

// Output compressor: 500 lines -> 30 essential lines
const compressed = await client.digest(rawBuildLog);

// Fuzzy answer cache: skip analysis if already computed
const cached = await client.cacheGet('security audit ERC-20');
if (!cached.hit) {
  const result = await runExpensiveAnalysis();
  await client.cachePut('security audit ERC-20', result);
}

// Code slicer: one symbol from a 10k-line file
const fnBody = await client.slice('src/index.ts', 'handleRequest');
# Web cache - zero tokens on repeat fetches
curl "https://kwj.ai/api/v1/web_read?url=https://docs.anthropic.com&api_key=kwj_xxx"

# Output compressor - 500 lines -> 30 essential lines
curl "https://kwj.ai/api/v1/digest?input=$(cat build.log | python3 -c 'import sys,urllib.parse; print(urllib.parse.quote(sys.stdin.read()))')&api_key=kwj_xxx"

# Fuzzy answer cache lookup (Jaccard 0.35 threshold)
curl "https://kwj.ai/api/v1/cache_get?q=security+audit+uniswap&api_key=kwj_xxx"
# {"ok":true,"hit":true,"value":"...cached analysis..."}

# Document extraction - PDF without full-load token waste
curl "https://kwj.ai/api/v1/doc_extract?file=/tmp/report.pdf&api_key=kwj_xxx"
pip pip install kwj
npm npm install @kwj/sdk
env export KWJ_API_KEY=kwj_your_key

Self-improving every hour.

The fleet runs an autonomous build loop. 48 new token-saving features ship every 24 hours. You get them automatically.

fleet-evolve — latest deployments
06-20 14:00 NEW custom-bash: TTL-keyed shell cache — eliminates 300+ repeated git/cargo calls per session (98.7% hit rate)
06-20 13:00 DEEPEN custom-digest: incremental tail mode — only digests new log lines since last poll, O(1) cost on long builds
06-20 12:00 INTEGRATE custom-mcp: fleet_plan + cache_get wired together — tool routing is now cache-first by default
06-20 11:00 DEEPEN custom-read: symbol-slice now supports Go + Python dataclasses, 4 new language parsers added
06-20 10:00 NEW custom-recall: TF-IDF auto-reindex on file mtime change — no manual index step ever needed

WEEK OF JUN 16

custom-bash: TTL-keyed shell cache, 98.7% hit rate on repeated git/cargo calls−98.7%
custom-read: symbol-slice Go + Python dataclass support, 4 new language parsers−97%
custom-mcp: fleet_plan + cache_get wired together — routing now cache-first by default−60%
custom-digest: incremental tail mode, O(1) cost on growing logs−94%
custom-recall: auto-reindex on mtime change, no manual index step−95%

WEEK OF JUN 9

custom-git: cached git wrapper, eliminates 300+ repeated git calls per session−99%
custom-queue: durable SQLite task queue with DAG + retry/backoffnew
custom-cron: self-healing cron scheduler with Telegram alerts on failurenew
custom-doc: PDF/CSV/XLSX extraction with table/search/convertnew
custom-audit: slither/echidna/foundry/cargo-audit unified findingsnew
50 tools deployed
360+ features this cycle
90.3% avg token reduction
99.9% API uptime

Why pay $200/mo for Max when $30 does more?

Claude Max charges you for tokens you would not have spent if the waste were intercepted first. KWJ intercepts the waste.

Feature
Claude Max alone ($200/mo)
KWJ + Claude Pro ($30/mo)
Token capacity
5x higher limit
Pro limit + 90% savings = same effective capacity
Infrastructure tools
None
50 tools — cache, compress, slice, recall
Self-improving
No
Yes — 48 new features deployed per day
Cache hit cost
Still burns tokens
$0 — answer served from cache, zero Claude calls
Monthly cost
$200/month
$19 + $20 = $39/month
Monthly savings
$170/month saved. Better tools. Same output.
API usage tracking
No per-call visibility
Per-endpoint usage dashboard included
Tool updates
No self-improvement
Auto-delivered — 48 tool updates every 24h

80% savings guarantee: if KWJ does not reduce your Claude token usage by at least 80% in your first 30 days, your first month is refunded. No questions asked.

Every number is a real session measurement.

No synthetic benchmarks. These are measured reductions from live Claude sessions.

Command Cache — shell call deduplication 0%

The same git status runs 300+ times per session without caching. One TTL-keyed hash returns the result instantly on every subsequent call.

300 calls → 1 real call  ·  cache hit rate: 98.7%

Smart Reader — symbol slicing 0%

Claude reads entire 2,000-line files when it needs one function. Symbol-slicing sends only the relevant span — 40 lines instead of 2,000.

2,000 lines → 40 lines per read  ·  98% token cut

Output Compressor — log noise removal 0%

Build logs bloat context. The compressor collapses repeated lines, elides middles, and always rescues error/warning lines. Errors are never lost.

500-line log → 30 lines  ·  errors/warnings always preserved

Memory Search — TF-IDF fact retrieval 0%

Full memory files load on every turn. TF-IDF indexing pulls only the 3–5 relevant fact chunks instead of loading the entire knowledge base.

TF-IDF  ·  per-file mtime indexing  ·  auto-reindex on change

Answer Cache — fuzzy analysis reuse 100% on hit

Expensive analysis gets re-derived from scratch. Jaccard shingle similarity at 0.35 threshold matches near-identical queries and returns the stored answer immediately.

Jaccard 0.35 threshold  ·  sha256 content-address  ·  $0 on hit

50 tools. One API key.

Every tool targets a measured token sink. The fleet compounds: each tool makes every other tool cheaper to call.

💾
CACHE

Command Cache

Runs shell commands once, reuses the result for hours. Eliminates 300+ repeated identical calls per session via TTL-keyed hash.

SLICE

Smart Reader + Code Slicer

Reads only the function you need. Extracts one symbol from a 10,000-line file. Rust, Python, TypeScript, Go, JavaScript.

🌐
CACHE

Web Cache

Fetches web pages once, serves from cache for days. TTL-keyed by URL hash. Eliminates redundant documentation fetches entirely.

📊
COMPRESS

Output Compressor

Shrinks 500-line logs to 30 essential lines. Collapses noise, always rescues error and warning lines. Failures are never dropped.

🧠
RECALL

Memory Search

Finds relevant facts without loading all memory. TF-IDF pulls only the 3–5 fact chunks you need. Auto-reindexes on file change.

📦
CACHE

Answer Cache

Never recomputes the same analysis twice. Fuzzy Jaccard shingle similarity at 0.35 threshold catches near-identical queries.

📄
PARSE

Document Parser

PDF, CSV, and XLSX extraction without token waste. 3-tier PDF pipeline: native text, OCR, Rust fallback. Structured output.

🔗
BRIDGE

MCP Bridge

One call to 25+ tools. No repeated tool loading. Web read, vault, cache, recall — all one hop away via the MCP protocol.

+
+42 MORE

Browser, proxy, email, TOTP, compute, storage, vault…

Git cache, job scheduler, task queue, code auditor, search engine. All included at $19/month.

Every agent type covered.

Security Auditor

Audit a 40k-line codebase without melting context

Smart Reader slices only the function under review. Output Compressor strips test noise from Slither/Echidna output. Answer Cache reuses prior invariant analysis.

98% token cut on symbol reads  ·  tools: custom-read, custom-digest, custom-cache
Research Agent

Fetch, index, and recall without re-downloading

Web Cache serves repeated URL fetches at $0. Memory Search pulls only the 3–5 relevant fact chunks from a large knowledge base using TF-IDF. Answer Cache short-circuits repeated analysis.

95% memory recall cut  ·  tools: custom-websearch, custom-recall, custom-cache
Coding Assistant

Navigate large repos without reading whole files

Context Map gives a symbol outline of any directory. Symbol Slicer sends only the function you asked about. Command Cache deduplicates git status, git log, and build commands.

300 git calls → 1 real call  ·  tools: custom-context, custom-read, custom-bash
Finance Data Agent

Pull market data once, use it all session long

Web Cache TTLs price feeds to avoid redundant fetches. Output Compressor trims verbose API responses. Vault stores API credentials with AES-256-GCM so secrets never hit the prompt.

$0 per cache hit  ·  tools: custom-websearch, custom-digest, custom-vault

Calculate your token reduction.

10
$0 estimated monthly savings

Based on 90% average token reduction applied to your plan cost.

Three steps. No new code.

01

Get your free API key

Enter your email below. Key issued instantly. No credit card required. 1-hour trial, then $19/month.

02

Add one environment variable

Set KWJ_API_KEY. Tools register as MCP endpoints in your Claude session automatically.

03

Watch the savings

Track token reduction live. Most users see measurable cuts in the first session. The fleet compounds with use.

FAQ answered simply.

What is a token and why does it matter?

A token is roughly 4 characters of text — a word, a symbol, a fragment of code. Every API call to Claude is priced per token in and per token out. A 500-line log file is about 8,000 tokens. If Claude reads it once per turn and you have 20 turns in a session, that one file costs 160,000 tokens — before you've done any real work. KWJ intercepts that before it reaches Claude.

How does the free trial work?

Enter your email and you get an API key immediately — no credit card needed. The key is valid for 1 hour and includes 100 API calls across all 50 tools. That's enough to run a real session and see the token savings first-hand. After the trial expires, you choose whether to subscribe at $19/month.

What happens after the 1-hour trial ends?

Your trial key stops accepting new requests. Your Claude session continues normally — KWJ tools simply return an auth error and Claude falls back to its default behavior. None of your work is lost. Subscribe at any point to reactivate the same key or get a new one.

Can I use my existing Claude account?

Yes. KWJ works with any Claude account — Claude.ai, Claude Pro, Claude Max, or the API directly. You add KWJ_API_KEY as an environment variable and the 50 tools register as MCP endpoints in your Claude session. No changes to your Claude plan are required.

Works with your existing stack.

Drop KWJ into any Claude workflow. No framework lock-in. Works with LangChain, raw Anthropic SDK, or bare HTTP.

kwj — end-to-end agent integration
Python + Anthropic SDK
MCP Config (Claude Code)
import anthropic
import kwj

# Both clients share the same session
claude = anthropic.Anthropic(api_key="sk-ant-your-key")
kwj_client = kwj.Client(api_key="kwj_your_key")

def run_agent_turn(user_message: str) -> str:
    # 1. Shrink context before sending to Claude
    compressed_logs = kwj_client.digest(read_build_logs())
    relevant_facts  = kwj_client.recall("project architecture decisions")
    code_symbol     = kwj_client.slice("src/main.rs", symbol="handle_request")

    # 2. Check answer cache — skip Claude entirely on a hit
    cache_hit = kwj_client.cache_get(user_message)
    if cache_hit.hit:
        return cache_hit.value

    # 3. Call Claude with a lean, pre-shrunk context
    response = claude.messages.create(
        model="claude-sonnet-4-5",
        max_tokens=2048,
        messages=[
            {"role": "user", "content": (
                f"Context:\n{relevant_facts}\n\n"
                f"Recent build output:\n{compressed_logs}\n\n"
                f"Relevant code:\n{code_symbol}\n\n"
                f"Question: {user_message}"
            )}
        ]
    )
    result = response.content[0].text

    # 4. Store in cache for future turns
    kwj_client.cache_put(user_message, result)
    return result
# ~/.claude/mcp.json — register KWJ as MCP server in Claude Code
{
  "mcpServers": {
    "kwj": {
      "command": "npx",
      "args": ["-y", "@kwj/mcp-server"],
      "env": {
        "KWJ_API_KEY": "kwj_your_key_here"
      }
    }
  }
}

# That's it. Claude Code now has access to all 50 KWJ tools:
#   kwj_web_read      — cached web fetch (TTL 3600s)
#   kwj_digest        — shrink log/command output
#   kwj_slice         — extract one symbol from a file
#   kwj_cache_get     — fuzzy answer cache lookup
#   kwj_cache_put     — store result for future turns
#   kwj_recall        — TF-IDF memory search
#   kwj_doc_extract   — PDF / XLSX / CSV extraction
#   kwj_git           — cached git wrapper
#   ... and 42 more

# Verify the connection:
$ kwj ping --api-key kwj_your_key_here
# {"ok":true,"tools":50,"plan":"trial","expires_in":"58m"}

Simple. Flat. No surprises.

One plan. Everything included. No per-seat, no per-call, no overages.

Monthly

$19

per month  ·  1-hour free trial

  • All 50 infrastructure tools
  • Unlimited cache hits ($0 each)
  • Command, web and git cache
  • Output compressor and code slicer
  • Document extraction (PDF, XLSX, CSV)
  • MCP bridge — 25+ tools, one call
Or start free 1-hour trial
🛡
80% savings or your money back.
If token usage doesn't drop by at least 80% in your first 30 days, your first month is refunded — no forms, no questions, no friction.

Get your free API key.

1-hour free trial. No credit card. 100 API calls included. Your key arrives instantly.

Please enter a valid email address.

No credit card required. One trial per email. Disposable addresses blocked. By submitting you agree we may contact you about your trial and updates.

KWJ Tool Catalog