davidkimai

Follow

💭

optimizing my reward function

davidkimai

💭

optimizing my reward function

Follow

Stay hungry. Stay foolish. - Steve Jobs. Building Kubernetes for Agents

557 followers · 260 following

Achievements

Achievements

Pinned Loading

Context-Engineering Context-Engineering Public

"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

Python 9k 1k
acp acp Public

ACP is a reusable democratic coordination primitive for discussion under load. Relay is the reference implementation used to validate that primitive in practice.

JavaScript 1
specoracle specoracle Public

SpecOracle is a Python evaluation pipeline for testing whether informal specifications can act as in-context oracles for secure program synthesis.

Python 3 1
bioguard_aixbio bioguard_aixbio Public

Apart Research AIxBio Hackathon Submission. BioGuard is a small research prototype for screening biological AI conversations before risk is carried across turns.

Python
RL101 RL101 Public

Agentic Reinforcement Learning 101. A pragmatic course for AI/ML Engineers based on "The Landscape of Agentic Reinforcement Learning for LLMs: A Survey" https://arxiv.org/abs/2509.02547

Roff 22 3
Langtons-Emergence Langtons-Emergence Public

Recently I have been researching emergent complexities through first principles reductionism of Langton's Ant and related cellular automata in the hopes that they could potentially offer insights i…

Python 16 6