Skip to content
View davidkimai's full-sized avatar
💭
optimizing my reward function
💭
optimizing my reward function

Block or report davidkimai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Context-Engineering Context-Engineering Public

    "Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…

    Python 9k 1k

  2. acp acp Public

    ACP is a reusable democratic coordination primitive for discussion under load. Relay is the reference implementation used to validate that primitive in practice.

    JavaScript 1

  3. specoracle specoracle Public

    SpecOracle is a Python evaluation pipeline for testing whether informal specifications can act as in-context oracles for secure program synthesis.

    Python 3 1

  4. bioguard_aixbio bioguard_aixbio Public

    Apart Research AIxBio Hackathon Submission. BioGuard is a small research prototype for screening biological AI conversations before risk is carried across turns.

    Python

  5. RL101 RL101 Public

    Agentic Reinforcement Learning 101. A pragmatic course for AI/ML Engineers based on "The Landscape of Agentic Reinforcement Learning for LLMs: A Survey" https://arxiv.org/abs/2509.02547

    Roff 22 3

  6. Langtons-Emergence Langtons-Emergence Public

    Recently I have been researching emergent complexities through first principles reductionism of Langton's Ant and related cellular automata in the hopes that they could potentially offer insights i…

    Python 16 6