Skip to content

ReputAgent/repkit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RepKit — A Reputation SDK for AI Agents

Status: Work in ProgressStar this repo to get notified when we ship.

RepKit turns every agent interaction into an evaluation event. When Agent A delegates to Agent B, Agent A observes the outcome. That observation becomes data. Accumulated data becomes reputation.

Because a benchmark is a snapshot — reputation is a trajectory.

Full product overview at reputagent.com/repkit

The Problem

Over 40% of agentic AI projects will be canceled by 2027 (Gartner). Teams can't answer a simple question: "Can I trust this agent?"

Benchmarks measure capability at one moment. They don't tell you if an agent is consistent, how it handles edge cases, or whether it's improving over time.

RepKit makes continuous evaluation operational infrastructure — not a gate before deployment, but a system that runs during production.

How It Works

Interaction → Evaluation → Accumulation → Reputation
  1. Interaction — Agent A delegates a task to Agent B
  2. Evaluation — Agent A observes the outcome and logs it via RepKit
  3. Accumulation — Evaluations aggregate across interactions and time
  4. ReputationTrust signals power routing, access, and governance decisions

API Preview

from repkit import RepKit

rk = RepKit(api_key="rk_...")

# Log an evaluation from an agent-to-agent interaction
rk.log_interaction_evaluation(
    interaction_id="txn-789",
    agent="agent-123",
    dimensions={
        "accuracy": 0.95,
        "safety": 0.88,
        "helpfulness": 0.93
    }
)

# Query reputation — accumulated from all evaluations
rep = rk.get_reputation("agent-123")
print(rep.score)        # 7.8
print(rep.trend)        # "improving"
print(rep.eval_count)   # 142
import { RepKit } from "@reputagent/repkit";

const rk = new RepKit({ apiKey: "rk_..." });

await rk.logEvaluation({
  interactionId: "txn-789",
  agent: "agent-123",
  dimensions: { accuracy: 0.95, safety: 0.88, helpfulness: 0.93 },
});

const rep = await rk.getReputation("agent-123");

What Reputation Powers

Use Case How Reputation Helps
Routing Which agent gets this task? Route based on track record.
Access control What capabilities unlock? Permissions earned through reliability.
Delegation Should A trust B's output? Historical evidence decides.
Governance What oversight level? Tiered autonomy based on trust signals.

Design Principles

  • Evidence over assertions — RepKit aggregates structured evaluation inputs over time, not single-run judgments
  • Reputation over scores — Signals accumulate across interactions and versions, producing durable reputation
  • Signals, not decisions — RepKit computes reputation signals; enforcement remains under your control

What RepKit Does Not Do

RepKit records evaluations, computes reputation, and exposes results via API. It does not:

  • Mandate a specific judge model or evaluator
  • Require a routing framework or agent runtime
  • Enforce decisions — you remain in control

Built on Documented Patterns

RepKit implements concepts from the ReputAgent evaluation patterns library:

Avoids documented failure modes:

Related

Get Early Access

RepKit is in development. Request early access at reputagent.com/repkit.

License

Apache-2.0 — see LICENSE.


Patent pending. RepKit represents one embodiment of the claimed inventions. Descriptions here are illustrative and do not limit the scope of current or future claims.

About

Reputation SDK for AI agents. Log evaluations, compute reputation, expose trust signals.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors