personal-ai-feed
AI-ranked personal intelligence dashboard. debug
Just my attempt to run benchmarks for performance in the browser and WASM, fully client-side. Check it out, and let’s see what else we can do with client-side WASM benchmarks. Feel free to contribute benchmarks you like, only condition is WASM and client-side. https://userfrom1995.github.io/benchd/
Hey HN, We're Gus and Carlos from Runtime ( https://runtm.com ). We're building infra that lets your whole team (including non-engineers) ship with Claude Code, Codex, and other agents without engineering having to handhold every session. After Mentum (YC S21) was acquired, I personally shipped 4 full-stack products in 3 months using coding agents. When I tried to roll the same workflow out to the rest of the team, it fell apart: Most PRs were unmergeable slop - Every repo re
I built this because my coding agent kept telling me he did complete the task, but when I verified it, it was not the case. I made Proof Loop fairly light, intentionally. It’s basically a protocol helper script for AI agent tasks: - set acceptance criteria before coding/implementation - keep the builder and verifier roles separate - each criteria tested with results PASS, FAIL or UNKNOWN - attach evidence of done - keep the proof evidence in the repo, so that the next agent / run can i
Hi HN! Outside of the hackernews bubble we often find engineers who are barely using AI (aka using microsoft copilot) and we needed an easy way to show the latest capabilities in a non confusing UI. So we dumbed down our product to a simple text box UI where you one-shot your feature and you get an email with a link to a PR in github. The backend is hosted in Cloudflare, spinning sandboxes in Daytona that run the Opencode harness. Feel free to give it a try or share it with people who are skepti