JamJet benchmarks
Reproducible measurements with full methodology and code. Each deep-dive includes the reproduction script, the raw numbers, and the competitive context where it matters.
Deep-dives
Memory · LongMemEval-S
Engram v0.1.0 on the LongMemEval-S leaderboard
Engram alongside Chronos, Mastra, OMEGA, and Zep — judged on the same methodology, compared on the axes that matter beyond raw accuracy.
Read → Runtime · Framework taxJamJet adds zero observable overhead vs raw LLM calls
Reproducible llama3.2 + qwen3:8b measurements with the same OpenAI-compatible client. JamJet vs raw vs LangGraph, three way, statistically tied.
Read →See also: JamJet comparisons — how JamJet fits with LangGraph, CrewAI, Temporal, Microsoft AGT, and the rest of your stack.