Friday, June 26, 2026

Show HN: Overfitted a 900KB Transformer to Compress a 100MB CSV into 7MB https://ift.tt/7vQywAh

Show HN: Overfitted a 900KB Transformer to Compress a 100MB CSV into 7MB I built an experiment that uses an overfitted transformer and arithmetic coding to compress individual files. Instead of training the model to generalize, I train a 900KB transformer to memorize a single file and predict the next byte. Those predictions are fed into an arithmetic coder to produce the compressed output. On a 100MB NYC taxi CSV, it compresses to about 7MB (~0.5 bits/byte). On a 100MB slice of enwik9, it compresses to about 21MB (~1.68 bits/byte). It's pretty slow right now (roughly 20–30 minutes of training and 45 minutes each for compression and decompression on my AMD 7800XT). Checkout the repo - https://ift.tt/n0Oie9j June 23, 2026 at 05:11PM

Show HN: TBD, a Mac-native CLI-forward coding agent multiplexer https://ift.tt/UDHNTIW

Show HN: TBD, a Mac-native CLI-forward coding agent multiplexer Inspired by Conductor, dmux, claude-squad, agent-deck, and Git Tower ## What makes it different: (Aside from GUI) A core tenet is -- everything a user can do manually, must be exposed via CLI for agents/automation Best paired with something that lets agents in different worktrees talk to each other (e.g. https://ift.tt/V524ISY ) ## Background: I used and loved Conductor for months starting around January, but hit some persistent issues that made me realize that a core tool that I'm actively using for most of my waking hours sits too close to my skin to produce itches that I can't scratch myself After realizing I needed to switch to something hackable, I went through a few week-ish long trials of dmux, claude-squad, and agent-deck. They were all great, but I then realized I really didn't want to memorize keyboard shortcuts, and I've managed to put off learning how to drive tmux for over a decade, didn't want to end that streak XD So TBD happened in March. In the months since, it's gotten stable enough to the point where a few former and current colleagues have switched to using it as their daily drivers as well. It's been kind of like a fun little club house we contribute to The architecture is a daemon that handles the bulk of state management and actual work, and CLI and GUI clients as two interfaces. Users go through GUI, LLMs and scripts go through CLI. It works best for Claude Code (our shared daily drivers) but two of us also use Codex on the side, so there's some basic support there as well The only way to run it is to clone and build from source, partially b/c I imagine the main appeal is for people who need to hack on the thing they're using (but also b/c didn't want to shell out for an Apple dev license) I think it's now a good enough starting point for similarly minded folks to use as a base to fork and build your own variants, tailored to your own workflows https://ift.tt/zbvWaKu June 26, 2026 at 08:59PM

Thursday, June 25, 2026

Show HN:Every Team Is Building the Same Cache https://ift.tt/hxfpUja

Show HN:Every Team Is Building the Same Cache https://ift.tt/ODibL2K June 26, 2026 at 01:40AM

Show HN: No chair fixed my back, so we built one that won't let you sit still https://ift.tt/fOdHMpD

Show HN: No chair fixed my back, so we built one that won't let you sit still https://ift.tt/6qvzsyU June 25, 2026 at 11:06PM

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion https://ift.tt/euG9ZQp

Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion Hi HN, Nick here. We’re launching OpenKnowledge ( https://ift.tt/zGirO4o ), a “what you see is what you get” markdown editor that has direct integrations with Claude, Codex, and Cursor. Available as MacOS app or CLI. Fully free/local and OSS ( https://ift.tt/PIZ5RVW ). We built this because we wanted a “Google docs” like experience for writing and sharing markdown files across our team. Obsidian is the best alternative we tried, but found it doesn’t have a true “what you see is what you get” UI and it didn’t integrate well with Claude/Codex outside of community plugins. So we built OpenKnowledge. It takes shape as: 1. A MacOS app with a file navigator, the WYSIWYG editor, and link explorer. 2. Integrations with the Claude, Codex, and Cursor desktop apps. The agents can open an OpenKnowledge editor within their embedded web browsers for a side-by-side experience. 3. Built-in mcps, skills, and RAG for LLM-wiki and “AI Second Brain” scenarios + spec writing 4. An embedded terminal and CLI for TUI-first users OSS stack includes: Tiptap/prosemirror, CodeMirror, yjs (CRDT), Electron (MacOS app), Orama, remark/rehype/micromark/mdast, @pierre/trees On the architecture side, the interesting eng. challenges included: 1. A pipeline to convert ProseMirror to markdown in a bidirectional lossless way. ProseMirror uses ASTs, which are not designed to have byte-fidelity. 2. A dual-observer CRDT to keep the ProseMirror and markdown state in-sync. The CRDT + git also power a collaborative experience that shows what Agents are doing in the markdown, have undo/redo, and version history. The “Share” and cloud-sync functionality are geared for team collaboration. They feel “no-code” but leverage git/GitHub under the hood, which also means data stays fully private. In that spirit, we made OpenKnowledge open source for anybody who’s curious or who’d like to contribute. We’re actively thinking about plugins/extensibility and what’s next. If you have suggestions or feedback, would love to hear it. https://ift.tt/PIZ5RVW June 25, 2026 at 08:04PM

Wednesday, June 24, 2026

Show HN: Dspyer – self-correcting, optimizable LLM steps for DSPy and LangGraph https://ift.tt/SGoLdvK

Show HN: Dspyer – self-correcting, optimizable LLM steps for DSPy and LangGraph https://ift.tt/2MUlZ5X June 25, 2026 at 01:08AM

Show HN: LookAway, a Mac break reminder that knows when not to interrupt https://ift.tt/KvUVZ7B

Show HN: LookAway, a Mac break reminder that knows when not to interrupt Hello, I'm Kushagra and I am the indie developer behind LookAway (I've posted about it earlier but it has received quite a lot of updates since the last time so I am posting it again). LookAway is a native break reminder for macOS that doesn't interrupt. I built it because I work from home and I spend a lot of time in front of my screens. It's very easy for me to get lost in the flow and I can end up sitting for hours. Due to this, I started facing issues like eye strain and back pain by the end of the day. The solution to this was simply taking enough breaks throughout the day. But remembering to take breaks was difficult, especially when I was in the flow. I tried some reminder apps but the problem with those was that they always interrupted me at the worst moments. So I ended up not using them. LookAway is designed not to interrupt. It gives enough heads up before a break so that you're not caught off-guard. It's also context-aware and it automatically pauses when you go into a meeting, start watching a video, record screen, and much more. It even waits for you to finish typing or dictating when a break is due. One thing worth mentioning is the free iOS counterpart LookAway Mirror. When your Mac goes on a break, your iOS devices can also mirror the same break so you don't end up scrolling your phone screen during the Mac break. I've spent a lot of time in making LookAway the least annoying break reminder app and I would love to know your thoughts. It's a native Swift app so it doesn't take much resources (150MB RAM and <1% CPU when idle). It's available to download from the website (lookaway.com), Setapp, and the App Store. Thank you! https://lookaway.com June 24, 2026 at 05:29PM

Tuesday, June 23, 2026

Show HN: The Cascade Graph – An interactive map of AI and energy constraints https://ift.tt/g8anMjy

Show HN: The Cascade Graph – An interactive map of AI and energy constraints Hello, I wanted to share with you all a interactive map of the economics and physics constraints of the AI buildout. It has macro drivers, industrial chokepoints, and where that shows up in markets. I've added 393 nodes and 562 edges to capture other supply / physics constraints as well. There's no sign up, and no pay wall, it's all free. Please let me know what you think! https://ift.tt/Fh5fwQC June 23, 2026 at 07:22PM

Show HN: Wordit – Change One Letter, Keep the Chain Going https://ift.tt/nzSy6og

Show HN: Wordit – Change One Letter, Keep the Chain Going Hi everyone, I got this idea for a game where, starting from a four letter word you need to go as deep as you can in your vocabulary, changing only one letter per word. bear -> beer -> peer... Each correct word gives you 1 point Each incorrect word takes one life away from you, you start with 3 https://ift.tt/6ESAcGd June 23, 2026 at 10:57PM

Monday, June 22, 2026

Show HN: I scanned every YC Spring 2026 startup for what AI crawlers see https://ift.tt/JP9zwBn

Show HN: I scanned every YC Spring 2026 startup for what AI crawlers see Used 'potatometer.com' to scan and analyze all All 197 YC Spring 2026 startups on their SEO / GEO / AEO technical setup. I scanned the URL each startup lists in YC's directory. Most are readable by AI crawlers. Most don't tell a crawler what they are. Read more in the blog above. https://ift.tt/VL8XvbS June 23, 2026 at 06:40AM

Show HN: Durable Agent Sessions API (Preview) https://ift.tt/cs6n8Ih

Show HN: Durable Agent Sessions API (Preview) https://ift.tt/AEnCNts June 23, 2026 at 05:37AM

Show HN: Kitcat 2.0 – A Matplotlib back end for terminal plotting https://ift.tt/ZGenzWi

Show HN: Kitcat 2.0 – A Matplotlib back end for terminal plotting https://ift.tt/BPesDTv June 22, 2026 at 09:30PM

Sunday, June 21, 2026

Show HN: Pure Effect – Reproduce production bugs on your laptop without a DB https://ift.tt/7QjbnyZ

Show HN: Pure Effect – Reproduce production bugs on your laptop without a DB Hi HN, I think it's safe to say that the majority of developers don't give a second thought to writing code with I/O tangled in business logic. It's all too common to see code like: const user = findUser(email); if (!user) await saveUser(user); Now, you may ask: what's the big deal? When we write code like this, two things happen: 1. It gets harder to debug production bugs. Unless you have the exact same database and remote API services to connect to, you may fail to reproduce the bug. 2. You have to use mocks and fakes in your tests, or use test containers, which only help somewhat, and they are slow! To solve these issues, I built Pure Effect, a tiny TypeScript/JavaScript effect library. The core idea is simple: if a function performs I/O, it isn't pure. But if it returns a description of the I/O it wants to perform, it is. So instead of await findUser(email), you return a Command object that says, "I would like to call this function, and when it finishes, here's what to do next." Your business logic becomes a pure function. Same input, same output, every time. The database never gets touched until the interpreter (runEffect) runs. When I first started the library, I didn't expect just how far that one idea would stretch. Once your pipelines are just data, a lot of wonderful things become possible: - No need for mocking libraries. You walk the tree in tests and assert on its structure: assert.equal(flow.cmd.name, 'cmdFindUser'). Nothing is executed. - Wrap any effect with Retry(effect, { attempts: 3, delay: 200, backoff: 2 }). The configuration is plain data, so you can assert on it in tests. - Every command's input and output flows through the interpreter, so you get a full execution trace for free. You can write a simple timeTravel() function that replays it locally without touching any I/O. Perfect for debugging complex production bugs. - An onBeforeCommand hook sits between your business logic and the interpreter. Since it sees every intended side effect before it fires, it can be used to enforce runtime guardrails. You can quarantine destructive calls before they happen for example. - You can review AI-generated code before it runs. Since Pure Effect pipelines are plain data, you can inspect what the generated code intends to do before it touches anything. There are just six primitives: Success, Failure, Command, Ask, Retry, and Parallel, plus effectPipe and runEffect. Zero dependencies. Under 1 KB minified and gzipped. How it compares to Effect-TS Effect-TS is the full-featured option in this space and has a large ecosystem. Pure Effect offers a different tradeoff. It covers the 80% case: testable pipelines, dependency injection, retry, and OpenTelemetry hooks, all in under 1 KB with zero dependencies and no new vocabulary to learn. Effect-TS is a framework you build around. Pure Effect, on the other hand, is a pattern you drop into existing code. I've been using Pure Effect in production since December. It's at v0.8.0, not 1.0 yet, but stable enough that I wanted to put it out there and hear what people think. GitHub: https://ift.tt/NaxgqJ7 I wrote five posts that document how Pure Effect evolved. They are tagged at https://ift.tt/JVM5GWh if you want the longer story. https://pure-effect.org June 21, 2026 at 09:36PM

Show HN: DebugBrief – turn debugging sessions into reports, no AI https://ift.tt/9gQY80R

Show HN: DebugBrief – turn debugging sessions into reports, no AI https://ift.tt/1IO7qZH June 21, 2026 at 11:57PM

Show HN: CleverCrow: give tokens to your favorite projects https://ift.tt/PrkhETB

Show HN: CleverCrow: give tokens to your favorite projects Howdy all. I'm Zack :wave:. I've been thinking about the problem of misguided AI pull requests and figured I'd throw a possible solution out there for feedback. Basically, CleverCrow lets supporters give tokens to a GitHub repo (or set of issues in that repo) for the maintainers to use to build/fix stuff. The fun implementation challenges have been around implementing the pooling dynamics and keeping the maintainers in charge while the backers are motivated to support their work. https://clevercrow.io June 21, 2026 at 11:06PM

Saturday, June 20, 2026

Show HN: An ASCII 3D Rendering Engine https://ift.tt/e7sQkTL

Show HN: An ASCII 3D Rendering Engine https://glyphcss.com June 20, 2026 at 11:10PM

Show HN: An n8n alternative where coding agents build the workflows, not humans https://ift.tt/mJWHIT1

Show HN: An n8n alternative where coding agents build the workflows, not humans n8n is built for humans dragging nodes on a canvas. That breaks down at B2B scale (embedding in a product, multi-tenant scalability, etc). n8n does have an MCP server so agents can create workflows too, but it outputs raw JSON. That's fine for n8n's engine, but painful for a coding agent (or a human reviewing its output) to read, write, diff, or debug. I'm building an alternative where workflows are authored by a coding agent in [a more dev-legible format] instead of JSON blobs, and execute it at scale. https://velane.sh/ June 20, 2026 at 10:44PM

Friday, June 19, 2026

Show HN: Jumpjet – a WASM runtime for game developers https://ift.tt/IfVpon2

Show HN: Jumpjet – a WASM runtime for game developers I built Jumpjet because I realized that engine and indie game developers are always repeating the same work: building the core infrastructure that touches the OS. Webassembly solves this in the Component Model by enabling interop between packages written in different languages. And in my opinion it's sort of the perfect fit for Jumpjet's model: providing a chassis without an engine. Jumpjet works by defining a very close mapping of WebGPU (and a few other WebIDL features) to WIT so that they can be used in any language that can target the wasm Component Model. Your game then runs as a guest application in Jumpjet's host runtime (powered by wasmtime), which shrinks final bundle size considerably versus something like Electron. Right now a bare bones game in Jumpjet is about 40mb. Right now the project is in an alpha or possibly pre-alpha state, it's not production ready. On the commercial side, I think there's an opportunity for cloud storage, game server hosting, a package manager and/or marketplace, distribution, and more. Right now you can target macOS, Windows, Linux, Android and iOS. (I haven't done any real testing on mobile so good luck.) The languages you can use will depend on their support for generating bindings from .wit files. There are a few templates available, I recommend one of the Rust ones. If you are a game developer or just like tinkering, I'd love for you to try the project out and tell me what you think! https://jumpjet.dev June 19, 2026 at 11:52PM

Show HN: Continuous Nvidia CUDA PC Sampling Profiler https://ift.tt/zyOb5ft

Show HN: Continuous Nvidia CUDA PC Sampling Profiler Blog post about how we extended our open source profiler to include support for continuous production PC sampling. https://ift.tt/A3NLoyY June 15, 2026 at 07:49PM

Show HN: I built with Claude Code a bookmarks manager with a social cut https://ift.tt/uckSVmF

Show HN: I built with Claude Code a bookmarks manager with a social cut Hi, I am Paolo from Italy. I have spent the last few months designing, developing and testing (my actual job is in the QA domain) a bookmarks manager platform with some help from Claude Code. The idea is to allow users to network with people with similar interests, sharing bookmarks and interesting links, libraries. The tool is free and you can try it out just entering with the guest user (no need to login or register). The guest user is created with demo bookmarks and libraries so you can look at the features. Please if you have some feedback or find this project of any interest, let me know! Ciao :) https://ift.tt/QfrJa6A June 19, 2026 at 11:37PM

Thursday, June 18, 2026

Show HN: I built a daily flag quiz in honor of the World Cup https://ift.tt/vxJC3A6

Show HN: I built a daily flag quiz in honor of the World Cup https://orbisearth.web.app/ June 19, 2026 at 12:15AM

Show HN: Run Agent Skills with mistral.rs v0.8.10: /v1/skills support and more https://ift.tt/ukGMWpU

Show HN: Run Agent Skills with mistral.rs v0.8.10: /v1/skills support and more Hey all! I'm the maintainer of mistral.rs. I just landed support for OpenAI-compatible Agent Skills via a /v1/skills endpoint, and it works with local open models. Until now Skills have basically been locked to closed models, and with the ability to have private, local intelligence becoming increasingly important, but this feature allows you to do XYZ with local models. It's fully compatible with OpenAI's /v1/skills API, so you can drop mistral.rs into your existing code with minimal difficulty. We support the accompanying tools too: /v1/files or input_file for attaching files to your prompts, and mistral.rs also allows models to send generated files back using the OpenAI-compatible method. It's also easier than ever to try mistral.rs: we are including prebuilt binaries for NVIDIA CUDA, Apple Silicon, and CPU! # Linux/Mac > curl --proto '=https' --tlsv1.2 -sSf https://ift.tt/LiT93Zg... | sh # Windows > irm https://ift.tt/LiT93Zg... | iex Then: mistralrs serve --agent --isq 4 -m google/gemma-4-E4B-it Super excited for you to try this out and any feedback! Do you have any suggestions for what you would like to see in the next releases? Check out the GitHub: https://ift.tt/moV2M7B Docs & Quickstart: https://ericlbuehler.github.io/mistral.rs/ June 18, 2026 at 11:03AM

Wednesday, June 17, 2026

Show HN: Chatty Lingo – A language practicing app https://ift.tt/oXJj0hk

Show HN: Chatty Lingo – A language practicing app Hey all, I've built a web app for practicing spoken conversations in a language you're learning. It was driven by my own needs when I learn Korean. The main functionality includes two modes: 1. Chat with AI — you pick a situation (ordering at a café, taking a taxi, checking into a hotel) and the AI plays the other role entirely in your target language. It stays in character, adapts to what you say, and you can fail as many times as you want with nobody watching. 2. Chat with Human — for real, in-person conversations with native speakers (think traveling abroad and talking to a local). It translates the conversation in real time so both sides can actually communicate, and saves the full transcript so you can review it afterward and learn from what was said. I'm a solo dev. The web app is live today and mobile is still being built. The app is still at early stage, I'm adding more features to it. I'd love some feedbacks, especially on what features you think are missing and what you would expect from a language learning app. Here is a demo page where you can play with it without signing up: https://ift.tt/wrhXpea https://ift.tt/8SkdzUv June 18, 2026 at 02:01AM

Show HN: I built a spelling app for kids with my 7-year-old https://ift.tt/kWDZdRB

Show HN: I built a spelling app for kids with my 7-year-old Hello HN! I made an iPad app with my seven-year-old daughter to make learning spelling fun. https://spellabee.com/ We play Spelling Bee type games in our car rides, and she wanted to learn more words that way. So we built a simple app that teaches 10 words at a time, and lets the kids practice and master these 10 words. The full word list in the app is static, and it gets progressively harder as the kid goes through the levels. There are no AI features in the app. I do not collect emails inside the app or have third party trackers. Based on feedback (reviews) and aggregate usage data I plan on updating the app with new word sets. Although the app does not have any AI features I used AI to build the app itself. I used Claude to code the app using Flutter, do etymology research, and understand what alternative apps that are in the App Store. While the LLMs were good at providing a lot of information, I had to synthesize it and play a strong Product Manager role to drive the vision and keep the app simple. My daughter provided a lot of feedback and helped simplify the app and refine the UX. The "Bee Stage" design was inspired by her drawing. Without AI tools, it would have been almost impossible for me to build and launch this app. But it still required a lot of decision-making and prioritization to get the product out of the door. I strongly believe that while AI is a powerful tool, human taste is the differentiator in well made products. If you have a kid in K-5 who is interested in spelling bee type games, give it a try and I would love to hear any feedback you may have as a parent. App store: https://ift.tt/bFTSJyt... https://spellabee.com/ June 18, 2026 at 01:36AM

Show HN: Vpod – Tiny Linux sandbox running in WASM https://ift.tt/m1fCODE

Show HN: Vpod – Tiny Linux sandbox running in WASM Hi HN, I spent the last few months reading the RISC‑V specification to build the lightest possible sandboxes. The idea behind a vpod is to quickly spin up a Linux sandbox from snapshots (Alpine by default) without any setup or subsystem required. The trade-off for portability and security is raw CPU speed. So we don't expect it to match native workloads with Python or pip, for example. More info is in the README https://ift.tt/cD7pyzB Happy to answer any questions! https://ift.tt/cD7pyzB June 17, 2026 at 08:41PM

Tuesday, June 16, 2026

Show HN: Sabela – A Reactive Notebook for Haskell https://ift.tt/uLVGS6X

Show HN: Sabela – A Reactive Notebook for Haskell Sabela is a reactive notebook for Haskell. The name is the Ndebele word for "to respond." Cells respond to each other on change. Initially it was meant as a tool for working with data but it has turned out to have a lot of pedagogical value outside of data analysis work. There is a gallery to read through on the website and a number of examples in the repo showcasing things like: * Python interop * Widgets and animation * Exploratory data analysis If you find any of this interesting please try it out. Any feedback is welcome. https://ift.tt/m4GOkJP June 14, 2026 at 12:33PM

Monday, June 15, 2026

Show HN: Vet turned founder, AI lawn diagnosis https://ift.tt/V6OunUh

Show HN: Vet turned founder, AI lawn diagnosis I know, it's kind of weird. What is a veterinarian doing creating an analysis tool for lawn problems? Frankly, the idea was born of my own lawn care struggles. Endless lawn care company fees without any actual improvement. Googling problems and finding generic solutions without regional considerations. One time I overseeded my grass not realizing I had to actually put soil down too. One day, I decided to run lawn pictures through AI and found some pretty helpful information. So I decided with my clinical background (the idea of treating the cause, not just the symptoms), as well as tech savvy, I would create an AI tool where homeowners can upload pictures of their lawn, enter their ZIP code, and get a diagnosis tailored to their location with actionable next steps in just 15 seconds. Completely free. The platform is monetized with affiliate sales (if a user elects to purchase through one of our Amazon or other links) and by selling exclusive rights to individual ZIP codes to lawn care companies seeking warm leads. Users can pursue their own DIY plan, purchase a lawn care subscription service, or contact a local lawn care system. I'd love if you'd test it out, toy with it, try to break it, and give me your feedback. Any feature requests would be super helpful. Thanks! Excited to hear your thoughts. Andrew https://grassdx.com/ June 15, 2026 at 09:56PM

Show HN: StarScope – Free astronomy dashboard for observers outside the US/UK https://ift.tt/w0pxJh2

Show HN: StarScope – Free astronomy dashboard for observers outside the US/UK https://starscope.live/feed June 15, 2026 at 11:21PM

Show HN: Understand and reduce token usage with ContextSpy context profiler https://ift.tt/1FRASO4

Show HN: Understand and reduce token usage with ContextSpy context profiler https://ift.tt/khdO5GA June 15, 2026 at 11:29PM

Sunday, June 14, 2026

Show HN: I hate typing continue once my CC quota resets https://ift.tt/ogLt876

Show HN: I hate typing continue once my CC quota resets https://ift.tt/4NJnGK9 June 14, 2026 at 11:18PM

Show HN: Solaris the Thinking Ocean Simulator https://ift.tt/aFHymz2

Show HN: Solaris the Thinking Ocean Simulator https://ift.tt/ldPjwZN June 15, 2026 at 01:17AM

Show HN: Ray Hosting – Topology-aware game server orchestrator made from scratch https://ift.tt/wpHj7rm

Show HN: Ray Hosting – Topology-aware game server orchestrator made from scratch Hey HN, I have built a game server orchestrator from scratch, As a solo-dev it took me 3+ years and almost 10 hours daily to finally complete it since i started in the beginning of 2023. Im 26 years old now!. The complexity and stuff i had to research to complete this project i couldnt have imagined them even in my dreams, but hey, here it is, my greatest professional achievement until now. Down below I will try to break down just some of the core and most important features of my game server orchestrator. 1. CORE PINNING & CCD CACHE ALIGNMENT I had to research and understand CPU cache layouts. I found out that if my game containers, which utilize docker run, span across different core complex dies (CCDs) or share SMT sibling threads with a busy neighbor, L3 cache thrashing ruins single-core tick efficiency. Then what I did is that I pinned all non game-server processes strictly on core 0 and its SMT sibling core 12 using GRUB: I disabled the 1000Hz timer interrupts to prevent context switching so as to not pollute the L3 cache. I also offloaded the rcu to cores 0 and 12 so as to avoid any micro interruptions on the game containers and leave 100% of the performance to the game containers. GRUB_CMDLINE_LINUX_DEFAULT="nomodeset isolcpus=1-11,13-23 nohz_full=1-11,13-23 rcu_nocbs=1-11,13-23" As for the game containers, as i mentioned i utilize docker run directly since swarm is not needed and would actually be bad design, I have the orchestrator service which utilizes and algorithm to calculate which CCD core is best to pin the game server container on: // Zen 4 core complex die (CCD) mapping in C# int siblingOffset = totalHardwareThreads / 2; int coresPerCcd = siblingOffset / 2; int getCcdId(int i) => ((i % siblingOffset) < coresPerCcd) ? 0 : 1; int getSibling(int i) => (i < siblingOffset) ? (i + siblingOffset) : (i - siblingOffset); I also set the memory limit and the memory reservation to be equal (--memory == --memory-reservation), in order to make the kernel lock that RAM memory physically RAM and block swap usage to avoid the noisy-neighbour problem. Since, as can be seen, the orchestrator tries to find the most performant threads for a game server, this means that the host node will get its cpu fragmented, specifically for this case I have an algorithm that simulates on the host node the best place for each running game container then relocates some or all of the container dynamically, live, without restarting the container or disconnecting any active player using: docker update --cpuset-cpus="{cpuSet}" {containerName} 2. EBPF/XDP + NFTABLES utilization for preventing ddos attacks, since game servers get constantly bombarded by ddos attacks, bots or otherwise specially targeted for many different reasons, could be whats called a script kid or sometimes even salty gamers, xd. In the beginning i tried to use UFW but ended up get rid of it since it conflicts with docker, which it took me quite some time to realize it in the beginning since i was still doing research on how things work on the network-level. In order to have the best protection I decided to have specific, per port connection rate limits. If the limits are hit I use a blacklist which the offenders ip is registered on, with a specific timer, then immediately register those blacklisted ips on the eBPF map. These IPs are dynamically added and removed from each list/map when the ban expires. There is AnonymousPipeClientStream edge case though, a lot of games have many different mods and plugins which can increase the rate of packets, even though I have tried my best to account for this in the default rate limit rules I have set, also allow the game server owners to actually adjust these limits if needed, cloudflare-style, by providing 4 profiles: Standard, Loose, Strict, UnderAttack. have optimized the standard one as best as I could, based on real life data, and it should be enough for 99% of the servers, the other profiles could be utilized in other rare cases for heavily modded servers for example. So the best approach for ddos mitigation is using nftables with per game server port limits have per game port nftables limits which I have also bumped the rmem_max/wmem_max buffers to 16MB so that specific game-container threads dont block when registering the map data directly into ram, by default the write buffer is tiny around 200 KB, by doing this the player ticks are processed quicker. Since the user needs to manage the game files, uploading/downloading/editing/deleting etc etc, I use fireqos to prioritize game traffic, meaning game traffic gets the fast-lane and is never throttled by the actions that the clients does using their file manager making sure that the game stays ping spike free. I also use TCP BBR Congestion Control instead of the default Linux CUBIC which is unoptimized and causes rubber-banding because it assumes that if there is packet-loss between the game server and the player there must be network congestion which as a result reduces transmission speed, which in turn causes lag spikes. What BBR Congestion Control does is that it measures the actual bandwidth between the game server and the player and sends the data packets at a speed which the player can consume and as a result avoids rubber-banding. I also use fq, fair queueing, in order to avoid a single game server owner from using all the bandwidth in case for example someone decides to upload or download huge files. # BBR Congestion Control net.core.default_qdisc = fq net.ipv4.tcp_congestion_control = bbr # UDP/TCP Buffer Expansion net.core.rmem_max = 16777216 net.core.wmem_max = 16777216 net.core.rmem_default = 16777216 net.core.wmem_default = 16777216 3. SSR CACHE POISON solution. In order to avoid angular ssr cache poisoning i have two endpoints, /graphql - public and read-only data which are directly cached on cloudflare, this endpoint rejects immediately any auth header, by rejecting the entire request, in order to prevent cache-poisoning and prevent any state sharing between requests. The second endpoint is /secure handles any authenticated data and does not cache anything. Also all my web services, like the front end, api, database calls use my private wireguard mesh which adds a layer of security. Also during SSR in Node.js I have skipped the TLS handshakes entirely which adds a bit of latency by using the local Docker swarm network for direct access to my api. ----- Since as I mentioned im a solo-dev, im bootstrapping this entirely out of my own pocket, I have two bare-metal nodes, one in Europe and the other on Central USA. Today, my goal is to see how my orchestrator handles real world usage before i scale up, so I invite anyone to spin up a game server by using my free trials and try to break my system. If anyone wishes, he can go directly on https://ray-hosting.com/en-US/free-trial and register to automatically claim the free trial. It requires a credit card though, solely for abuse protection. OR, if you dont want to put your card down which is understandable, i can spin up a trial for you from my admin panel directly after you register so that you can test my system's abilities, just drop a comment here since I will be watching the thread today. I would really love to hear honest thoughts and opinions on the architecture, deployment speed, or any other thing you want to discuss. PS: im not a native english-speaker so I had a hard time putting this together, lol, btw, I do have a lot more stuff to talk about my platform but for now this drained me. Lol, thank you very much for reading. https://ray-hosting.com/en-US June 15, 2026 at 12:03AM

Show HN: Trace – Offline Mac meeting transcripts you can flag mid-call https://ift.tt/kFJyl6D

Show HN: Trace – Offline Mac meeting transcripts you can flag mid-call I'm the developer of Trace, a non-intrusive, shortcut-driven Mac app that records and transcribes your meetings on-device. I know, another meeting transcription app. Please bear with me though, I'm confident that this is at least a little novel. I primarily built Trace for myself. I'd been using MacWhisper, but there was enough fiddling before each call that I'd forget to start it and walk out of an hour-long meeting with nothing written down. So the things I cared about most were that it's quick to activate and stays out of the way. You activate Trace by pressing a global shortcut (configurable), which reveals a small bar at the bottom of your screen (there's also a keystroke and/or option to hide it entirely if you'd rather not see it at all). As I was building it I wanted to bake in a couple of workflows I'd wished for in other transcription apps. 1. Mid-meeting you can press another global shortcut to mark a "key moment" and type a note. The note shows up in the resulting transcript inline at that timestamp. I wanted to add this because I kept catching myself thinking "wait, that bit matters" in meetings and reaching to jot it down in a separate app like Obsidian, which I then needed to add context to, which took me out of the meeting. I use it all the time. If I paste the transcript into an LLM afterwards (which I find myself doing more and more these days) the important moments are flagged so it doesn't gloss over them. This is more noticeable in longer meetings with lots of topics. 2. With another keyboard shortcut you can summon a rough live recap (subtitles, basically) to quickly recap what's just been said. Trace uses standard macOS microphone and system recording APIs to capture both sides of the conversation as two separate tracks and then runs the system side through on-device diarization to identify speakers. Right now we only label them as "Speaker 1", "Speaker 2", etc but there are plans for speaker labelling in the future. You can also show a "live recap" as the call is happening to review what someone just said. All transcription models run on your machine. To be clear though, Trace doesn't do any of the summarising itself, it just produces a markdown transcript, so if you want summaries then you need to pass the output to an AI. The app is sandboxed and your audio/transcripts are never uploaded anywhere - they just exist as audio files and markdown on disk. The only network call Trace is required to make is on the first run to download the speech and speaker models (around 500MB) from Hugging Face, and after that it can be used fully offline. If enabled, a Google Calendar integration can auto-name sessions but that needs a network connection. The app is £9.99 on the macOS App Store. I've been using it every day for months now and I'm super happy with how it's improved my workflow. Feedback very welcome. https://traceapp.info June 14, 2026 at 12:41AM

Saturday, June 13, 2026

Show HN: Bye-wk – Hide World Cup news from your feed https://ift.tt/AzBavkw

Show HN: Bye-wk – Hide World Cup news from your feed https://ift.tt/V3eUqN7 June 14, 2026 at 01:50AM

Show HN: Galdor – a Go LLM agent framework with built-in tracing and replay https://ift.tt/5OmRs1j

Show HN: Galdor – a Go LLM agent framework with built-in tracing and replay https://ift.tt/LUXSFCB June 13, 2026 at 11:04PM

Friday, June 12, 2026

Show HN: Lead Qualifier – Get leads qualified in minutes https://ift.tt/qFc7WL1

Show HN: Lead Qualifier – Get leads qualified in minutes https://ift.tt/qCuRPrn June 13, 2026 at 06:36AM

Show HN: Turn your name into a tree in an infinite procedural shanshui landscape https://ift.tt/VrAlUiR

Show HN: Turn your name into a tree in an infinite procedural shanshui landscape Hi HN! I made this after collecting hundreds of "name → tree" submissions at ITP. Live: https://ift.tt/dR52mfo Source: https://ift.tt/RANfmhy Plant a tree: https://ift.tt/px9BYD3 Pan and zoom an infinite procedural landscape. Each name is converted to ASCII codes, which grow into a unique tree (breadth-first branching; repeated digits become mathematical roses). Mountains use midpoint displacement + Perlin noise, with SVG radial gradients in the blue/green/gold palette from Wang Ximeng's "A Thousand Li of Rivers and Mountains." Inspired by Lingdong Huang's {Shan, Shui}* ( https://ift.tt/qtHRjPN ). Every tree is someone's name, signed with an APack stamp ( https://ift.tt/fyQ5wc6 ). Try planting your name, then pan along the ridgeline to find it. "My trees" lets you jump back to ones you planted. Happy to answer questions about the terrain algo, name→tree encoding, or the Riso print we tiled at ITP Winter Show! https://ift.tt/dR52mfo June 10, 2026 at 06:39PM

Show HN: Nenya – A lightweight, highly secure AI API Gateway/Proxy written in Go https://ift.tt/6hAdNOx

Show HN: Nenya – A lightweight, highly secure AI API Gateway/Proxy written in Go https://ift.tt/c4KiyLe June 12, 2026 at 09:32PM

Thursday, June 11, 2026

Show HN: Nuts – pip/NPM for Java with first-class workspaces, JDK provisioning https://ift.tt/rC87lut

Show HN: Nuts – pip/NPM for Java with first-class workspaces, JDK provisioning My frustration with distributing java apps didnt show up recently. I remember having implemented my first network jar downloaded back in the 2000's because i needed applet like feature support with desktop full control. Years after, the problem is the very same. Webstart didnt really took off and the only mean i had in my projects was the ugly fatjars, including the (for me) uglier spring-boot repackaging that changes the application classloading behaviour and hence giving me by time some headackes i was not prepared for. So basically nuts started as a response to this frustration 9 years ago, but from now i think its mature enough (used in production) to be shared, and forecebly i am more keen to need suggestions and help from fellow contributors. https://ift.tt/irDh9Hm June 11, 2026 at 02:23AM

Show HN: AVP – an agent can't leak a secret it never had https://ift.tt/YAIBDQT

Show HN: AVP – an agent can't leak a secret it never had A process can't leak a secret it never had. Shai-hulud, prompt-injection - you name it. They cannot steal what your agent (or an process) don't have. I run coding agents (Claude Code, Codex) on my own machines most of the day. Every one of them wants real API keys in env and I was scratching my head for the last few months how to contain it. The usual answer to this is a firewall. I don't buy it. A firewall tries to contain a secret the process is still holding, and the rules are painful to maintain. AVP gives the agent a placeholder and injects the real value at the last moment, on the wire: ``` # the agent's env holds only a placeholder STRIPE_API_KEY=avp-placeholder # agent sends: Authorization: Bearer avp-placeholder # AVP forwards upstream: Authorization: Bearer sk_live_...real... ``` Keep your passwords in your vault where they belong. AVP initially relies on Bitwarden as a secret manager. It's MIT licensed. Appreciate any feedback. https://ift.tt/qQ3GS1P June 11, 2026 at 11:10PM

Wednesday, June 10, 2026

Show HN: Kctx – A read-only Kubernetes context engine for SREs and AI Agents https://ift.tt/REDP9i5

Show HN: Kctx – A read-only Kubernetes context engine for SREs and AI Agents https://ift.tt/WBpnHPO June 11, 2026 at 12:44AM

Show HN: Atlasphere – Live Infrastructure Diagrams https://ift.tt/wvmNBWq

Show HN: Atlasphere – Live Infrastructure Diagrams Hi HN. My name is Andrey. On a regular business day, I'm a software engineer working at AWS. Outside of work hours, I spend time on my hobby - writing code. I was once building a pet project that allowed customers to spin up fully synchronized blockchain nodes within just a few minutes. The backend was split into a control plane and a data plane, each with its own AWS account. Later I added two more AWS accounts. One for shared RPC nodes. One for the Analytics Service. Since I love to visualize things, I used drawio to visualize the architecture. With time, I noticed a pattern. I'd write some code, add a few lambda functions, update my drawio diagram, write more code, introduce a few more resources, test things, see that everything works fine and go to sleep with a smile on my face. Next week I'd check my diagram, and shockingly, it's missing some of the resources! This kept happening for a few more weeks until I decided to fully abandon the project until my infrastructure diagrams could stay in sync with my cloud account. That's how Atlasphere.io was born. I've been working on it for the past 6 months and I think the product is ready for some feedback :) A few notes: - Atlasphere uses a ReadOnly IAM role to scan your AWS account (my account reaches your account through a trust relationship). - The number of services is currently limited (WIP) - It's a macOS app - It's NOT an Electron app, i use Rust + Webview What am I looking for? All I really need is for someone to try the app and tell me what they like about it and what they absolutely hate about it, haha! The website is https://atlasphere.io/ June 9, 2026 at 04:35PM

Tuesday, June 9, 2026

Show HN: LocalCode – turn plain English into CLI commands with Apple's local AI https://ift.tt/qWojEPG

Show HN: LocalCode – turn plain English into CLI commands with Apple's local AI https://ift.tt/iWHLkTB June 10, 2026 at 01:04AM

Show HN: OpenYabby, voice-controlled multi-agent orchestrator for Claude Code https://ift.tt/i85B9bz

Show HN: OpenYabby, voice-controlled multi-agent orchestrator for Claude Code https://ift.tt/TnYFQ78 June 10, 2026 at 12:08AM

Show HN: Transit-format (JSON/MessagePack) reader/writer in C https://ift.tt/fNoRyJT

Show HN: Transit-format (JSON/MessagePack) reader/writer in C Transit.c is an addition to the set of libraries to support transit data interchange format written in C11. It supports full 0.8 specification of cognitect's transit-format: JSON, JSON-Verbose and MessagePack encodings, all ground and extension types, compression via keys caching, extensibility via custom tag handlers. https://ift.tt/l8Nmpd9 June 8, 2026 at 01:35PM

Monday, June 8, 2026

Show HN: Courtside – TUI for NBA Games https://ift.tt/4HQE1ZL

Show HN: Courtside – TUI for NBA Games Hi HN, I made this after seeing a few similar projects on the front page. NBA API endpoints are public and there’s a pretty robust python package ( https://ift.tt/RDcyEqB ) that I referenced for the endpoint structure to build an sdk in go. used BubbleTea and LipGloss for styling. It was a bit tricky to test the live endpoints but I watched Friday’s Final game with this and it worked pretty well playball - https://ift.tt/3M7J9Wx faceoff - https://ift.tt/1gb8zWN https://ift.tt/mIUnsu6 June 6, 2026 at 03:43AM

Show HN: HTTP/3 and raw QUIC client/server APIs for Node.js https://ift.tt/XdSvu73

Show HN: HTTP/3 and raw QUIC client/server APIs for Node.js I built this because I wanted to make outbound and accept inbound HTTP/3 and raw QUIC connections from ordinary Node.js code, without building Node from source or putting everything behind a reverse proxy. Repo: https://ift.tt/y4Xqm6J npm: https://ift.tt/qWKaOo3 It’s a native package around Rust/quiche. It supports both client and server APIs, I'm using it in a couple of projects: creating raw QUIC streams, datagrams, custom ALPN, session behavior, and HTTP/3 client work from Node. I've tried to be very safe in the native code, written in rust, with proofs around the parts I was most concerned about getting wrong. I have it hosting a couple of sites as HTTP3 endpoints and found it working well. https://ift.tt/y4Xqm6J June 8, 2026 at 10:38PM

Show HN: Stop returning raw JSON from MCP servers, build rich inline UIs https://ift.tt/1NwO6VF

Show HN: Stop returning raw JSON from MCP servers, build rich inline UIs https://ift.tt/9OjrnbE June 8, 2026 at 10:37PM

Sunday, June 7, 2026

Show HN: An mkv player that uses WASM to render you videos https://ift.tt/GjN3YSq

Show HN: An mkv player that uses WASM to render you videos hello HN i want to share this wasm experience i built for a universal mkv player on the web using wasm to ship a lean decoder only ffmpeg build, thus way codecs unsupported by the browser can be played I wonder if this holds any value to anyone anymore https://parallax.kinosoft.moe/ June 8, 2026 at 03:57AM

Show HN I scraped 743 large employers' careers pages to find their ATS https://ift.tt/eUTns5l

Show HN I scraped 743 large employers' careers pages to find their ATS https://ift.tt/ZCsNkLM June 7, 2026 at 09:45PM

Show HN: I Derived a Pancake https://ift.tt/IPwv4UH

Show HN: I Derived a Pancake After 25 years of making other people's pancake recipes - always yearning for more tang, more fluff, and more predictability - I decided to derive the pancake recipe from the chemistry. You mark checkboxes for what you have on hand (ricotta, sour cream, kefir, buttermilk, yogurt, cottage cheese, lemon, cream of tartar, etc.) and it computes the best recipe based on targets for acid, fat, salt, sugar, and CO2. My particular favorite are the yeast-raised lemon ricotta kefir pancakes - the best I've ever had. The math is done in a small pure-ESM library: ingredient composition to component masses and acid moles, a stoichiometry layer, and a bisection solver for the target deficits. I'm not a chemist, so if something is off, tell me and I will fix it! https://ift.tt/HM12ac9 June 5, 2026 at 10:42AM

Show HN: Nightwatch, The open-source, read-only AI SRE https://ift.tt/ZkShXja

Show HN: Nightwatch, The open-source, read-only AI SRE nightwatch is a local-first, read-only layer on top of your monitoring. it groups alert storm into incidents, flags noisy checks and has an agent that can investigate for you live systems. You can e.g. jump from the incident into the agent directly. the reason for this weekend project is that we had a kubernetes upgrade that went wrong, and at some point a rollback wasn't possible anymore, so it had to be fixed live during the night while several problems came together. We run a lot of different systems, on-prem and several Kubernetes clusters, and in a situation like that you spend most of the time just figuring out what is actually broken and where. So i thought that it would be pretty cool to have eyes in the dark in each system that can talk to your "brain". so the idea is to put a baby owl into each environment. Each owl runs where the systems live, keeps that environment's credentials local, and only dials outbound to a central brain, so there is no inbound hole into prod. It exposes a set of read-only skills, and the agent uses them to gather evidence and form a root-cause hypothesis, so the on-call engineer starts with a head start instead of from zero. read-only for now, i don't trust it near prod yet and honestly neither should you. llocal-first for easy self-hosting and to keep credentials on your side. the clustering and recommendations run fully offline with no llm at all. the agent needs a tool-calling llm, you can point it at a remote one, or self-host one (ollama etc.) if you want to stay fully offline. for non selfhosters: before every remote llm call, nightwatch strips real secrets (unrestorable) and swaps identifiers like ips, hostnames and paths for reversible placeholders, so the model only sees masked data while real values are restored only in the proposed commands and tool calls Would love if you try it in your Systems https://ift.tt/R4n2PYk June 8, 2026 at 12:24AM

Saturday, June 6, 2026

Show HN: Dap-mux – Connect your editor and REPL to the same debug session https://ift.tt/xNLWFYM

Show HN: Dap-mux – Connect your editor and REPL to the same debug session I have been coding over four decades, in many languages, on many projects (including Firefox, Final Cut Pro, the Newton, and Fullwrite Professional if you can remember that far back; all these using my "dead-name"). I wrote something small and simple to scratch an itch. It's the UNIX philosophy: small "one-trick ponies", each *really* good at their one trick, then the user can hook them together to solve actual problems. I'm a CLI guy, and for almost everything, I already have this. But not for debugging. The itch I scratched was the connector that enables this philosophy for debugging. That thing is dap-mux. A DAP multiplexer turning a one-to-one protocol into a cooperating session of as many tools as you need to get it done! How it started: Helix and Python for me (and sometimes IPython), with the rest of my team using PyCharm (which I have long loved!). My team's problem is that they want the PyCharm debugger, and so must be satisfied with the JetBrains editor. *My* problem was I could use a full-blown debugger *or* I could have IPython *or* I could have Helix (or sometimes an unsatisfying combination of Helix and the debugger). That was my "itch". DAP (Debug Adapter Protocol) is the tantalizing answer, except it isn't. DAP is what editors (that don't want to write their own debuggers) are starting to adopt. The problem with DAP is it's one-to-one. One editor connects to one debugger. Done. Not a solution to my problem. And then suddenly, it *was* the solution. I realized that a DAP multiplexer would let you connect any DAP-aware editor to any debugger for any language, and simultaneously to a REPL, another session of your editor (or a different editor)! With the side benefit that now, like screen or tmux, since each process is its own thing: sessions are durable. Just restart whatever crashed and you're back where you were! There were hard parts: sequencing, late joiners, state management. Different end-points working on different actions in different sequences but with the same message ids. I solved these problems something like how NAT works. Instead of translating network addresses, though, I'm translating the sequence numbers of each client into something global and ordered, then correctly routing replies back to the end-point awaiting them, while mapping the sequence numbers for those replies back into the space of that end-point. Knowing the current state of the debugger, and replaying that as a message sequence to late joiners lets you start/connect the clients in any order. I chose Python: asyncio fits the I/O-router pattern perfectly, and it lets the IPython extension run in-process rather than over IPC. There are problems not yet solved: for instance, I think configuration in the clients and/or the startup sequence is too complicated. But it functions! I got what I wanted! The combination I use every day: Python + debugpy + Helix + IPython, all connected simultaneously. Step with `%n` or `%s`, evaluate expressions with `%eval`, watch Helix track the current line in real time. Rust with codelldb is the second confirmed combination — I debugged a Dijkstra implementation with Helix and a third-party DAP observer tool both connected to the same codelldb session. A community member, Sean Perry, has already built [dap-observer]( https://ift.tt/psEKtxv ), which renders the current frame's variables as a navigable terminal tree. *This* was my exact dream! Small, focused, connectable tools all playing together! There's so much left to try: other editors, other debug adapters, Windows, other languages. None of this has been touched yet. The most helpful thing now is people testing it with their own setup and reporting what they find. It's time to play! `uv tool install 'dap-mux[ipython]'` for Python + IPython. `uv tool install dap-mux` for headless use with any language and adapter. No need for any part of the Python ecosystem. https://ift.tt/UEmyBv8 June 7, 2026 at 01:13AM

Show HN: I ported Xonotic (arena FPS) to WebAssembly with full P2P multiplayer https://ift.tt/bJ1xesK

Show HN: I ported Xonotic (arena FPS) to WebAssembly with full P2P multiplayer https://dpgame.xonotic.workers.dev/ June 6, 2026 at 11:29PM

Show HN: Typedframes – Pandas/polars column name checking at lint time https://ift.tt/L3j7NV6

Show HN: Typedframes – Pandas/polars column name checking at lint time https://ift.tt/xMUuwnX June 7, 2026 at 12:32AM

Friday, June 5, 2026

Show HN: I nerfed our coding agents on purpose https://ift.tt/DugVRFl

Show HN: I nerfed our coding agents on purpose Tl;dr: I trained a classifier to route to the least expensive model and reasoning depth to complete the request. Coupling that with additional automated token efficiency techniques has yielded 3x usage for the same spend. For anyone interested in trying it themselves: https://nerfguard.com Various teammates and I switched over to Codex from Claude Code recently. We still bounce between the tools, but Codex’s speed and steerability coupled with performance gains were hard to ignore. One of the downsides was that the per token pricing kicked in way sooner. This is happening across the board, but we felt it in Codex more acutely. We’re a startup filled with people who work around the clock and are obsessed with building — naturally our daily bill alone was striking. Luckily we’re going after a big mission and speed matters significantly more than marginal token spend on the edges. Still, it got us thinking about how it was ludicrous that while our product has a side effect of decreasing token spend and speeding up agentic workflows by many orders of magnitude, we were using these top tier models for all types of internal coding tasks without any of those optimizations. The waste felt pretty ridiculous — the most glaring culprit was that we were seemingly using the max intelligence model on max reasoning for every task even when the task clearly didn’t require it. As a company who spends a lot of time on cached intelligence, it was also easy for us to see how there was plenty of other low hanging fruit as well. So, on a recent weekend, I quickly built a tool to optimize our usage. At its core is a very fast classifier that classifies your requests to the least intelligence required for the task and includes some nice token optimizations on top. The result is roughly the same quality for multiples lower token spend. But even more exciting for us, is that the properly bin packed intelligence and reasoning levels meant our speed also went up considerably. This wasn’t negligible. We’ve observed up to 3x savings and hours per day per person in saved time that we would have otherwise been waiting on tool turns and coding agent responses. For us, that means improved engineering velocity and significantly higher usage for the same spend. It also means more usage before getting throttled. As I told friends about this, they also wanted to start using it to maximize the usage they could get out of their coding agent plans. There are now engineers across many of the most cutting edge AI companies using this tool to optimize their token utilization in this way. Not just to save money, but to maximize output. Turns out that the best way to avoid getting nerfed by Claude is to intentionally nerf yourself selectively. We decided to release it for the rest of the builder community to use as well. You can now turn on Nerfguard for yourself and start getting more usage today. June 6, 2026 at 03:19AM

Show HN: OWASP VulnerableApp Modern Extensible and Scalable vulnerable app https://ift.tt/MGH63uV

Show HN: OWASP VulnerableApp Modern Extensible and Scalable vulnerable app https://ift.tt/ONXVEiL June 6, 2026 at 12:19AM

Show HN: I rebuilt a tiny old volleyball game I loved https://ift.tt/TzkM0Zx

Show HN: I rebuilt a tiny old volleyball game I loved https://volleyhop.com/ June 6, 2026 at 12:12AM

Show HN: Bash Runtime for AWS Lambda https://ift.tt/s8WHJA7

Show HN: Bash Runtime for AWS Lambda Hi HN, I built a Bash runtime for AWS Lambda to make writing glue code simpler and faster. Sometimes, all you need is a bit of `sed`, `awk`, maybe a loop and a few HTTP API calls, and this runtime gives you all the tools to do that. It comes bundled with `jq` and `curl` so you can handle JSON payloads and string together HTTP API calls right out of the box, including calling AWS services with `curl --aws-sigv4`. In keeping with the theme, the Lambda handler contract is also made as simple as practical: read from stdin, write to stdout, return 0 for success and non-0 for error. You can run shell scripts, call binaries (either what's available in `al2023.provided` or you can package your own static binaries with your handler), or a combination of both. If you remember nodding along to Adam Drake's post about how bash and coreutils can be faster than a Hadoop cluster, I hope you give this a whirl and find it useful. The runtime is packaged as a Lambda layer, so it should drop right into your normal AWS infrastructure. https://ift.tt/4Kk5Ryx June 5, 2026 at 11:12PM

Thursday, June 4, 2026

Show HN: Digger Solo – Local AI File Explorer https://ift.tt/o1ZqRJH

Show HN: Digger Solo – Local AI File Explorer After a lot of work I present Digger Solo 0.5.0 - the AI file explorer that respects your privacy (everything runs locally). Demo video: https://ift.tt/3W9VH04 New features: - LLM Chat with RAG (bring your own OpenAI compatible API key - ideally host a local model) - fresh redesign with light theme available in settings - multi-tabbed GUI: open multiple semantic maps at once - smart music player: auto-plays similar songs Digger Solo offers semantic search and maps that allow you to browse your files intuively - uncovering hidden connections and near duplicates easily. Happy to answer questions. https://solo.digger.lol June 4, 2026 at 11:25PM

Show HN: Bot or Not – Spot AI-generated randomness https://ift.tt/tNSDq9A

Show HN: Bot or Not – Spot AI-generated randomness https://play-bot-or-not.vercel.app/ June 4, 2026 at 11:56PM

Wednesday, June 3, 2026

Show HN: Fork of Rsync https://ift.tt/J0mfCWZ

Show HN: Fork of Rsync Hello. After hearing of the problematic LLM commits in rsync, I made a fork of rsync. I decided to fork it off release 3.4.1, since I heard that's the last release without the LLM code. https://ift.tt/gpsnQ6m June 4, 2026 at 02:20AM

Show HN: Lint Your Markdown with ESLint https://ift.tt/P83scE1

Show HN: Lint Your Markdown with ESLint https://ift.tt/w0yLMf1 June 3, 2026 at 05:47PM

Tuesday, June 2, 2026

Show HN: Live breath detection and biofeedback from a phone microphone https://ift.tt/MvVx8kI

Show HN: Live breath detection and biofeedback from a phone microphone Hi everyone, I am Felix, a famliy doctor from ZH, Switzerland. A couple of month ago I started this little project called shii • haa, a breathing app that uses the phone`s microphone for live biofeedback My prior work in emergency medicine and intensive care was closesly linked to breathing, mostly in critical situations... and let me to reevaluate my own way of breathing. over time one question popped into my mind: can medical knowledge and biofeedback make an app actually promote self-awareness instead of attaching your goals to the award system of the app. it combines signal processing, a breathing state machine and ML. The state machine follows inhale, exhale and transitions in the mic signal. A quality layer rejects noisy or ambiguous windows before signals are used for feedback. All processing is done on-device, no speech or raw audio is uploaded. What I'm trying to avoid is turning breathing into another score or game. The app gives feedback on rhythm, depth and regularity, but the point is more "notice what you are doing" than "perform well". I'd be interested in feedback, especially from people who have worked on signal processing, health UX, or Android/iOS audio issues. https://ift.tt/1qK6Xy9 June 2, 2026 at 08:02PM

Show HN: 100cc - Roll your own Claude in 100 lines https://ift.tt/8mzA2Sd

Show HN: 100cc - Roll your own Claude in 100 lines https://ift.tt/7XgdfLl June 2, 2026 at 10:35PM

Monday, June 1, 2026

Show HN: A free Linux adaptation of NETworkManager by BornToBeRoot https://ift.tt/octAkn5

Show HN: A free Linux adaptation of NETworkManager by BornToBeRoot https://ift.tt/DrwO1jz May 30, 2026 at 08:40PM

Show HN: Trumpstonks – every company Trump's named, backtested vs. the S&P https://ift.tt/EzYp02T

Show HN: Trumpstonks – every company Trump's named, backtested vs. the S&P https://ift.tt/xskVOPF June 1, 2026 at 09:30PM

Sunday, May 31, 2026

Saturday, May 30, 2026

Show HN: leaf – one month later: website, releases and lots of improvements https://ift.tt/4j3X2fu

Show HN: leaf – one month later: website, releases and lots of improvements Hi HN, About a month ago, I shared leaf here while it was still in its early stages. Since then, the project has shipped multiple releases, with UX improvements, bug fixes, and a documentation website now available. leaf is a terminal-based Markdown reader focused on a GUI-like experience, with navigation, search, table of contents, clickable links, syntax highlighting, editor integration, LaTeX rendering, Mermaid diagrams, and more. It works on Linux, macOS, Windows, and Termux. GitHub: https://ift.tt/gLGFjBd Thanks to all contributors and everyone who starred the project for their support, and feedback on UX, performance with large files, and missing features is still very welcome. https://ift.tt/UfL3gZv May 30, 2026 at 11:09PM

Show HN: UN Condemnation Statistics https://ift.tt/sgYuAH4

Show HN: UN Condemnation Statistics https://boxed.github.io/UN-condemns/ May 30, 2026 at 08:57PM

Friday, May 29, 2026

Show HN: AionOS – self-healing microkernel in Zig (boots on real hardware) https://ift.tt/eC9qwf8

Show HN: AionOS – self-healing microkernel in Zig (boots on real hardware) https://ift.tt/uH1CykG May 29, 2026 at 09:42PM

Show HN: Vibewarz – bot vs bot arena for vibecoders https://ift.tt/jmieRlq

Show HN: Vibewarz – bot vs bot arena for vibecoders https://vibewarz.com May 29, 2026 at 08:48PM

Thursday, May 28, 2026

Show HN: htop for the airwaves — a live 802.11 RF dashboard in your terminal https://ift.tt/jdnvFu3

Show HN: htop for the airwaves — a live 802.11 RF dashboard in your terminal https://ift.tt/wbHNC2M May 29, 2026 at 05:51AM

Show HN: Bootstrap a team of coding agents from a template, OSS https://ift.tt/cLWfAN6

Show HN: Bootstrap a team of coding agents from a template, OSS I have spent the last few months working on infrastructure and tools to give agents global ids, and the ability to communicate. That is up and running now, but actually structuring their work together has been a real pain: I still have to give them roles and responsibilities, and start the agents in the right directories with the right id so that the actually get things done. I have automated that part now: a team can be bootstrapped from a template with one command: aw team bootstrap https://ift.tt/wcPHUr0 \ --username \ --work-directory /path/to/your/repo So far I have published three templates that I find useful (linked from the submitted github page). I am looking for feedback, please let me know what you think and how to make it more useful. One direction I am considering is to replace the markdown-based structure with an actual ontology. https://ift.tt/oLA6l0n May 28, 2026 at 10:30PM

Show HN: Scrolodex – A super simple window switcher for macOS https://ift.tt/nOg42vG

Show HN: Scrolodex – A super simple window switcher for macOS I built scrolodex to scratch my own itch of having a quick and simple way to switch between the currently open windows under my cursor. Simply hold ⌥ + scroll to cycle through windows under your cursor. Release to focus. Also includes triggers for scrolling through all windows, dock app's windows, or switching between desktop spaces. Configurable hotkeys, themes and overlays. Completely free and OSS. brew install --cask jaydenfyi/tap/scrolodex Website/demo: https://scrolodex.app/ GitHub: https://ift.tt/d34nAlp https://scrolodex.app/ May 29, 2026 at 12:02AM

Wednesday, May 27, 2026

Show HN: An update to our long-turn FreeCiv experience https://ift.tt/Ik3ahmf

Show HN: An update to our long-turn FreeCiv experience So we have had quiet the journey here. So 70 days ago (aka 73 turns ago) I posted on HN sharing our FreeCiv deployment ( https://ift.tt/WQuf7cn ). FreeCiv is a great game, the clients is very buggy however. I'm using the GTK4 version, but a few others have opted for the QT variant. At some point, we might turn our focus to contributing to improving the client based on our experiences playing the game. We've since added a lot of little fun features: - The editor: you can write to the newspaper Editor and they /might/ publish what you write, quote you, or decide you're full of it and write an opinion piece slamming your reputation. The editor will also reach out to a few players, each turn, and ask for their input on current game matters. - The Intelligence Dashboard. People were forgetting what they were up to, so we added a dashboard showing the timeline of what happens per turn for your player. - beta the online map viewer: I wanted a way to view the map without loading the client, so we started working on a beta map viewer that is HTML based. - The Chronicle (The newspaper) has also grown a bit. Maybe too much? We'll see. The crossword is fun. Some other 'fun' things that happened: my brother in law stopped speaking to me because of in game banter that was taken way too seriously. My friends invaded my wifes territory, and well, she didn't like that either. I'm currently in the lead, but theres still a long way to go from 475BC. https://ift.tt/Av4HdtX May 28, 2026 at 04:24AM

Show HN: Open-Source AI Racing Harness https://ift.tt/cVshwrI

Show HN: Open-Source AI Racing Harness Hi I'm Dan from Elodin, making an open source real-time capable flight software simulation. For AI Grand Prix contestants, the wait for the Round 1 virtual qualifier simulation has been grueling. If you’re competing, check out our simulation harness to tide you over, built to match the published competition constraints and message format. It runs against real Betaflight, which we learned requires at least 1000 sensor samples per second to run real-time correctly. The competition warranted introducing a new feature to generate the camera sensor directly in the simulation loop. Typically people connect to Unreal or similar game engine to create a camera sensor, which works well but is very heavy. For the simple needs of this challenge, creating sample directly in the loop is very handy and easy to use. Happy to hear your feedback on this! While it's not fancy looking currently, it uses the Rust Bevy game engine, which should allow us to improve the visual fidelity quickly. We all should easily be able to shift our implementation to the published competition sim once it lands. Hope you enjoy and good luck! https://ift.tt/q3RnL6A May 28, 2026 at 12:37AM

Tuesday, May 26, 2026

Show HN: DDoS detection in 500 lines of Python (MIT, no cloud, no account) https://ift.tt/cakVn6m

Show HN: DDoS detection in 500 lines of Python (MIT, no cloud, no account) https://ift.tt/xohpZz9 May 27, 2026 at 12:39AM

Show HN: I used AI to built static recompilers for 5 retro game systems https://ift.tt/OIMpN0w

Show HN: I used AI to built static recompilers for 5 retro game systems https://1379.tech/nes-snes-genesis-virtualboy-and-psx-a-journey-with-ai-and-recompilation/ May 26, 2026 at 09:38PM

Monday, May 25, 2026

Show HN: TryPost – open-source Social Media Scheduler https://ift.tt/WL4xHgC

Show HN: TryPost – open-source Social Media Scheduler https://trypost.it/en May 26, 2026 at 12:25AM

Show HN: Write your BPF programs in Go, not C https://ift.tt/wPj5eNI

Show HN: Write your BPF programs in Go, not C https://ift.tt/HIGTJQr May 21, 2026 at 08:25PM

Show HN: I made Pokémon but with real animals in the real world https://ift.tt/XtGNbDr

Show HN: I made Pokémon but with real animals in the real world Firstly, apologies, it's not free. It would be difficult to support this for free, it's a paid game. I will now share the technical details, which will probably be most of interest for HN readers. I previously made a carbon footprint tracking app where you photo objects and it tells you the carbon footprint by using an LLM to estimate the data on the fly, e.g. 32kg CO2e / kg of beef, in the UK. At some point, I realised that it is possible to make a Pokémon-style game, but capturing real animals in the real world. This is now possible because: - image recognition is cheap, i.e. identifying animals, and the models (gpt-4o) can detect a (surprisingly) large number of animals and output their exact species. - LLMs can output a species' full taxonomy, pretty reliably. And, more importantly, they can generate game data quickly, on the fly. It would unfeasible to generate the game sprites (images) for every species (millions, worldwide) and their full evolution chain, e.g. caterpillar, chrysalis, butterfly, ahead of time. I realised it's possible to do this in real time. General game flow: - photo animal - send to gpt-4o - return species - send species to LLM, create evolution chain, plus attributes, types and moves. - in parallel, create sprites. All data is cached. The aim of the game is to build up your team and compete with other players to take over gyms. The game is based in the real world, I had to come up with a way to have health centres and shops. These must both have decent coverage, globally. The solution is health centres are places of worship, e.g. churches, mosques, temples etc and shops are real world grocery stores. Every country as far as I can tell has places of worship, with good distribution, which was surprising. Gyms are located in every park worldwide. Challenges: How to get players outside: - I use openstreetmap for the game map, but I overlay my game design on top of it. - To physically make players go out into nature: I use openstreetmap area types to only allow capturing animals when your GPS location is in natural areas, e.g. woodland, parks etc. The aim of the game is to get you out into nature and appreciating animals. - Level system: The solution I came up with is to set the animal levels based on the proximity to built-up areas, e.g. Every ~500 meters you go away from built-up areas, the animal level bands increase by 5 levels. - It would be expensive to render the entire physical world in my game map, so I instead render the map on the fly, deterministically. I also fetch animal calls in real time so that when they enter battle you hear a pigeon cooing, for example, which is pretty cool. I also fetch the animals conservation status, i.e. how endangered is it, and give you more reward (leaves, in-game currency) for capturing rarer animals. I "launched" the game about a month ago, but have not really been publicising it as I've been working on various updates and improvements, but now I am sharing it more openly. It's got about 20 players so far, from around the world, and around 500 unique animal species have already been encountered. Challenges have been keeping the costs low. Servers cost about $200 / month, text-gen is basically free as I get free tokens from OpenAI for sharing data, it's not privacy-related, and image-gen costs about $0.04 per sprite (2 per animal). My background: not a programmer, originally a mechanical engineer and then business development manager, then started learning programming and building apps with AI in the last few years. Feel free to ask me any technical details, happy to share. https://ift.tt/wOXv9U8 May 25, 2026 at 11:48PM

Sunday, May 24, 2026

Show HN: CRED-1 – Open domain credibility dataset for on-device pre-bunking https://ift.tt/pJveMVd

Show HN: CRED-1 – Open domain credibility dataset for on-device pre-bunking https://ift.tt/xDCKri9 May 24, 2026 at 10:58PM

Show HN: My homelab is outperforming the stock market https://ift.tt/ElOBCA2

Show HN: My homelab is outperforming the stock market https://stocks.sjer.red May 25, 2026 at 01:54AM

Show HN: Replacing a 3.4MB video with 40kb of GSAP https://ift.tt/ZWEPzHj

Show HN: Replacing a 3.4MB video with 40kb of GSAP https://ift.tt/Okea0bn May 25, 2026 at 12:59AM

Saturday, May 23, 2026

Show HN: Running BitNet b1.58 inside DRAM by breaking DDR4 timing rules https://ift.tt/OLIGyHr

Show HN: Running BitNet b1.58 inside DRAM by breaking DDR4 timing rules I have been working on running BitNet b1.58 inside DRAM by intentionally breaking DDR4 timing rules. Also made a visual explainer: https://pcdeni.github.io/CaSA/explainer/ This is tested and works inside commercial off the shelf memory with custom memory controller in the FPGA. The underlying effect is well characterized in academic papers (cmu safari, simra, dram bender, etc). In the process of getting this to work I also made previously undocumented discovery about DDR behaviour: https://pcdeni.github.io/CaSA/explainer/xor-spread.html Overall it is a bit slow, since data (in full rows) needs to be moved even when what is actually needed is only the count of the '1' bits (popcount). To make it competitive memory die changes would be needed, but not as drastic as merging compute and memory into one silicon. This would then avoid the memory wall issue the industry is currently facing. May 23, 2026 at 10:54PM

Show HN: Vibe-coded Steam, but in the browser https://ift.tt/e4ZOyXR

Show HN: Vibe-coded Steam, but in the browser Hi HN! Lifelong avid gamer here, hugely passionate about WASM and WebGPU. I firmly believe that these technologies will enable console and PC quality titles to be accessible through a browser, and with this, we'll need a new discoverability layer. Looking online, platforms like CrazyGames and Poki cater to a casual/hypercasual demographic, and I couldn't find anything out there that was for me, a core gamer that typically uses Steam and consoles. So I vibe coded my own! It features WASM ports of classic games, as well as some indie Unity titles. The goal is to host mainly WebGPU titles moving forward, and to serve as a way for smaller developers to get discovered outside of crowded channels like Steam. Here's a few features from the platform I wanted to highlight: • Controller support • A console-like UI/UX • Community forums (much work to do here) • Basic achievements • Store pages, modeled after Steam • Social features • Asset chunking to enable faster load times I'd love to get feedback on the portal, to make it even better. Thanks! https://gameghost.manus.space/ May 23, 2026 at 11:54PM

Show HN: A satirical idle game about running an AI startup https://ift.tt/q8h7uUi

Show HN: A satirical idle game about running an AI startup I made an idle/clicker about running an AI startup. You start with a cat-vs-dog classifier and try to make it to AGI, but the NYT sues you for training data, Yann tweets that scaling is dead, and your fired ML engineer leaks the Slack. https://ift.tt/N0IwSoq May 23, 2026 at 10:54PM

Friday, May 22, 2026

Show HN: Coder Words – An offline-first PWA word puzzle for programmers https://ift.tt/MLcPz2O

Show HN: Coder Words – An offline-first PWA word puzzle for programmers It's a clone of 7 Little Words, but with topics from computer science and programming. No sign-up, no app install, no tracking. It's a PWA and works offline, also as a home screen app. Tech: js, no libs, Canvas API, Web Audio, AI-aided but not vibe coded, puzzles curated by hand. https://ift.tt/WgnwsVX May 23, 2026 at 12:30AM

Show HN: Quit All, an iOS app with an SOS mode for cravings https://ift.tt/KEexT4L

Show HN: Quit All, an iOS app with an SOS mode for cravings I built Quit All, an iPhone app for breaking bad habits. The main idea is that most habit trackers help after relapse, but cravings happen before that. Quit All has an SOS mode with a timer, GIFs/prompts, streak tracking, relapse logging, savings, milestones, danger-time stats, and iOS widgets. YouTube demo: https://www.youtube.com/watch?v=qwNK4rqOY88 App Store: https://ift.tt/VmoU6gN Website: https://quit-all.com May 22, 2026 at 11:31PM

Show HN: CoreMem – Portable context for AI agents https://ift.tt/iAmdEja

Show HN: CoreMem – Portable context for AI agents CoreMem lets you build collections of context, called a mem, and share it with any AI agent via URL, a Chrome extension, MCP, Cursor/VS Code plugins, a skill, and more. Instead of re-explaining your project or goal when you switch agents or start new sessions, CoreMem keeps your context centrally organized so that any AI tool can read it. This originally started as a CLI I built that kept pieces of context (Project A/B/C details, my writing style, preferred tech stacks, coding style, etc) in a SQLite database. I could instruct various agents to “use my `coremem` CLI to retrieve details about [project A] before we get started.” It solved a problem for me b/c I am continually bouncing around between different projects and chat agents, and having to re-explain myself every time became an exercise in either repeating myself or copy/pasting summaries I’d saved from previous sessions. I decided to make this a little more robust and portable, so I turned that original CLI into a SaaS. Tl;dr: You can create a “mem”, which is a collection of 1 or more pieces of related context, and share that mem with any agent to quickly get them up to speed. Right now I’ve got integrations in the form of revokable share links, a Chrome Plugin, Cursor Plugin, Cursor/VS Code extension, Claude Code plugin, ChatGPT/Claude/Gemini/et al via MCP. Since I mostly work from the CLI, I use the Claude Code plugin or create 5-min share links I can drop into a chat, but I’ve tried to make this useful to people who mainly work from a browser or an IDE. I’ve been coding for 30+ years, and I vibed most of this. I was able to use CoreMem to help it built itself as I jumped between various coding agents, having them grab context then start a new task. I’m sure my architecture and engineering experience helped, but building this in a few weeks confirmed for me that the barrier for someone to build a tool they need to solve a problem is incredibly low. The rush I used to get from coding has mostly faded, but I’m getting similar rushes managing different agents to build things now. https://coremem.app May 22, 2026 at 09:52PM

Thursday, May 21, 2026

Show HN: I Made a Claude Skill for Spec-Driven Development (SDD) https://ift.tt/lDFJiyd

Show HN: I Made a Claude Skill for Spec-Driven Development (SDD) At my work they provided a single Claude subscription for everyone on the team. To be honest I like kiro better as it provides a way better SDD management. But the company can't provide it and I can't afford it yet. Turns out I had the skill creator skill in my claude instance so I made use of it to create this Skill. I made it fully by using Claude but I wanted to make it open source, so I asked it to help me make tests and preparations for it, even a CI to run python tests. Well, we got this results with it: - Phase 2A: 67 static assertions (Python script, runs in CI) - Phase 2B: 15 behavioral tests (live Claude Code session) - Phase 2C: 53 generation quality checks across 3 end-to-end flows All of these passed and the CI also passed (after a few tries). I made it to suit my way of prompting and coding and based it off kiro's SDD management, but I want it to be publicly available and used by many people. According to claude some of the testers need to fit the following criteria: 1. Developer starting a real new project from scratch 2. Solo dev with an active side project (greenfield or partial codebase) 3. Team lead whose team uses multiple AI tools 4. Developer with an existing codebase and no written specs 5. Developer who actively uses 3+ AI coding tools It's actually a blind test, no guiding, just try it if you can, I'd really appreciate your help. The repo is here: https://ift.tt/MqrGupe https://ift.tt/MqrGupe May 21, 2026 at 04:49PM

Show HN: Freenet, a peer-to-peer platform for decentralized apps https://ift.tt/aXlqI9p

Show HN: Freenet, a peer-to-peer platform for decentralized apps For the past 5 years or so I've been working on a ground-up redesign of Freenet, my peer-to-peer project from the early 2000s (now renamed Hyphanet). The new Freenet has been up and running since December along with some early applications like River[1], our decentralized group chat and Delta - a decentralized CMS. Users have already started to build their own apps on Freenet including games, and we have some interesting apps in development like Atlas, a search/recommendation engine. Architecturally, this new Freenet is a global, decentralized key-value store where keys are webassembly contracts which define what values (aka "state") are valid for that key, how or when the values can be mutated, and how the state can be efficiently synchronized between peers. We've developed a unique (AFAIK) solution to the consistency problem, every contract must define a "merge" operation for the contract's associated state. This operation must be commutative, meaning that you can merge multiple states in any order and you'll get the same end result. This approach allows state updates to spread through the network like a virus[2], which typically achieves consistent global state in a few seconds or less. Like the world wide web, Freenet applications can be downloaded from the network itself and run in a web browser - similar to single-page apps on the normal web. However, rather than connecting back to an API running in a datacenter, the webapp connects locally to the Freenet peer and interacts with Freenet contracts and delegates over a local websocket connection. If you'd like to try Freenet we have convenient installers for the major desktop OSs but not yet mobile, and you can be chatting with other users on River within seconds[3]. Happy to answer any questions, you're also welcome to read our FAQ[4], or watch a talk I gave back in March[5]. [1] https://ift.tt/zKD8MVy [2] https://ift.tt/kF9e6fN [3] https://ift.tt/OjoYB3z [4] https://ift.tt/eOGjqxL [5] https://youtu.be/3SxNBz1VTE0 https://freenet.org/ May 21, 2026 at 06:34PM

Wednesday, May 20, 2026

Show HN: Dari-docs – Optimize your docs using parallel coding agents https://ift.tt/gL5MlxE

Show HN: Dari-docs – Optimize your docs using parallel coding agents It’s well known at this point that documentation needs to be optimized for AI agents - we’re all pointing our Claude Code / Codex / Pi agents at documentation, and expecting the models to figure out how to implement a product. This, however, changes the entire optimization problem when writing documentation. Good documentation now becomes more objective - you are solving the very concrete problem: can a dumb harness running the dumbest model implement this reliably? Humans can typically compensate for inconsistent terminology or scattered context across pages, but for agents, this often will waste time (or even just completely confuse the agent). We’ve been building a small project around this called dari-docs: users can upload their documentation via website or CLI and run agents across different providers to see where they falter. You can upload your documentation, feed a list of tasks, and ask agents with varying intelligence / cost levels to complete those tasks in parallel. When a run is complete, you get back a list feedback markdown files from each agent run and can apply changes based on agent feedback. Managed service: https://ift.tt/kR0TtBv , repo link: https://ift.tt/ZqAk2fl The agents actually try to use the product end-to-end. They search through the docs, follow instructions, run commands, try examples, and attempt to debug failures. Importantly, this is not a static LLM review of the documentation. The agents are actually attempting the integration. You can also enable live verification with test credentials so the agents can actually verify workflows against real APIs: dari-docs check . --live-verify --secret-env DARI_TEST_API_KEY --task "Create a checkout session" If you’re building a CLI, API, MCP server, or SDK and actively maintaining docs for humans or agents, we’d love to work with you and test this on real workflows! https://ift.tt/ZqAk2fl May 20, 2026 at 08:53PM

Show HN: IgniteMS – batch text embeddings at 253K msg/s on 8x A100 https://ift.tt/y19PZVG

Show HN: IgniteMS – batch text embeddings at 253K msg/s on 8x A100 https://ift.tt/m2ykXgD May 20, 2026 at 09:07PM

Tuesday, May 19, 2026

Show HN: Logbox – let Claude monitor your dev logs https://ift.tt/Ayr8KUf

Show HN: Logbox – let Claude monitor your dev logs TL;DR: logbox is an open-source tool that pipes dev server logs to a local sqlite db with ` | logbox collect`. Give Claude Code access by running `claude mcp add logbox -- logbox serve`. I used to copy & paste logs into Claude Code when manually testing my server in dev. I wanted to give it its own verification loop. I initially tried having it boot the server itself and follow the logs. It was good at knowing if the server booted properly, but it capped out and missed details when the logs started flowing in. I also tried piping the logs to a local file and telling Claude to read them from there. It worked, but became annoying once we had multiple services or wanted to reference past dev server sessions. So I built logbox for ourselves at Struct and decided to open-source it. It’s a simple Rust CLI that pipes logs into a local SQLite db with an MCP server that gives coding agents the ability to search them. Once it could reliably monitor the dev server logs totally autonomously after testing its changes, I stopped needing to fish for log snippets and keep nudging it to get a manual test working end-to-end. Everything stays local. `logbox serve` is an stdio MCP server and it just works with the local SQLite db. https://ift.tt/F53Dhbr May 19, 2026 at 11:03PM

Show HN: Gaussian Splat of a Strawberry https://ift.tt/I3ZckaL

Show HN: Gaussian Splat of a Strawberry The Setup: https://ift.tt/PfqjgVb https://ift.tt/dVOTQ1a https://ift.tt/sbwJ9Ny https://ift.tt/Q8XUwgo https://ift.tt/FpiqQUx May 19, 2026 at 02:38PM

Monday, May 18, 2026

Show HN: Better.ftp – cycling app for FTP tests without subscription https://ift.tt/KXidwB1

Show HN: Better.ftp – cycling app for FTP tests without subscription I built a free iOS app so cyclists can stop paying $20/mo just to retest their FTP. The ramp test itself is a 25-minute Bluetooth interaction with a known protocol. It shouldn't require rent. This app is opensource, free, no ads, no tracking nor third party integrations because I just wanted to have a fun side project and have no interest in monetization. If you are a cyclist, I would kindly ask for feedback on the app especially on the ftp test protocol instructions, trainer-model compatibility, and bug reports App store link: https://ift.tt/SmyW8gn GitHub (opensource project): https://ift.tt/ENL2rmj Demo gif: https://betterftp.cc/promo.gif https://betterftp.cc/ May 19, 2026 at 01:33AM

Show HN: Cubic Doggo, a Open-Source 12-DOF 4-Legged Robot Based on ROS2 https://ift.tt/2tYl7kz

Show HN: Cubic Doggo, a Open-Source 12-DOF 4-Legged Robot Based on ROS2 This is a recipe for building intermediate-priced robot dog from scratch with all commercial/3D-printed parts, controlled by Rasp Pi 5 and ROS2 Jazzy. A manually coded walk gait is implemented so far, which can be controlled by a controller to move forward or change directions. It does not yet have an IMU required for RL training; however, I believe it's one of the simplest design out there available for multiple development paths. https://ift.tt/ELCmuhM May 18, 2026 at 09:20PM

Show HN: We missed Winamp, so we built an audio player for macOS https://ift.tt/B86bNKT

Show HN: We missed Winamp, so we built an audio player for macOS https://ift.tt/3DTJwK9 May 18, 2026 at 11:20PM

Sunday, May 17, 2026

Show HN: Cheap-IM: Thinking Machines' demo on a CPU laptop https://ift.tt/1yoFk7V

Show HN: Cheap-IM: Thinking Machines' demo on a CPU laptop https://ift.tt/TpACdsK May 18, 2026 at 03:49AM

Show HN: Mezz, a curl-able WiFi sandbox for IoT pentesting https://ift.tt/BhJ5VFP

Show HN: Mezz, a curl-able WiFi sandbox for IoT pentesting https://ift.tt/oIXGSb6 May 15, 2026 at 06:53PM

Show HN: How to Kill the Dead Internet https://ift.tt/oXiVdCn

Show HN: How to Kill the Dead Internet Ok, so maybe "how to revive the internet" would be more accurate, but if you're reading this, I got your attention, right? Here's why I want you to read on: I built a free extension, D-slop, to disincentivize anyone from posting AI writing, and eventually images and video as well, on the internet. For writing, it checks known vocab and punctuation tells, as well as subtler tells related to cadence, and assigns it a score subject to an adjustable threshold. If the text fails, users have the option to flag offending text, hide it, or block the page entirely (with the option to see anyway). For media, it's admittedly fairly weak, as it relies on C2PA metadata which is stripped from all of the social media sites where it would be most helpful. (Anyone else have chronically online boomer parents continually gobbling up slop like it's real information?) I have a D-slop+ version in the works that should be able to handle the media itself, but it's going to have to make API calls to have real teeth, which means I can't offer it for free. If this extension validates the concept, I'm happy to build it for y'all. Yes, I vibe-coded it, but an ancillary bonus to the project accrued when it inspired me to cook dinner listening to Metallica's "Fight Fire with Fire," which in turn brought my 5 y/o running into the kitchen with every musical instrument in the house for an impromptu karaoke speed metal session. It's MIT license open-source, full brief at https://ift.tt/kAxDcdh ; This forum is full of people smarter than me, so I'm open to suggestions. https://ift.tt/uYFES9M May 18, 2026 at 05:35AM

Show HN: Serene Bach – a Go weblog engine that runs as CGI or HTTP https://ift.tt/6kyYjPC

Show HN: Serene Bach – a Go weblog engine that runs as CGI or HTTP I originally made Serene Bach in the 2000s as a weblog engine written in Perl CGI. I rebuilt it from scratch in Go as a single binary that can run either as a CGI program or as a normal HTTP server. I know CGI is generally considered legacy technology now, but I still rely on it for shared hosting. In this version, I added Markdown support, a responsive default theme, Open Graph image generation, and static output generation. It is still in beta, but the repository includes a Docker image published on GHCR, documentation, and a local quick start. I'd appreciate feedback from anyone interested in small self-hosted publishing tools, especially if you still care about shared hosting or CGI-style deployment. https://ift.tt/VTpaF3z May 17, 2026 at 07:47AM

Saturday, May 16, 2026

Show HN: Codiff, a local diff review tool https://ift.tt/fPjtok1

Show HN: Codiff, a local diff review tool Nowadays I review a lot of code locally that was written by llms. I used to review my own code using git + delta. It started to feel limiting with the amount of code written by llms. When looking at a large diff on Friday I pointed an llm at diffs.com and trees.software and told it to build an app. It only took 16 minutes, is extremely fast for large diffs, beautiful and minimal. Today I polished it up and added all the features that I need. It has file filters, search, an llm walkthrough mode, and review comments that you can paste back into your llm. I will be using Codiff a lot, and can finally review the large diff from Friday that led me to build this If you like it, fork it! https://ift.tt/b3ph4MU May 17, 2026 at 09:30AM

Show HN: I built a screen recorder that captures console logs, requests and more https://ift.tt/3TlQHpq

Show HN: I built a screen recorder that captures console logs, requests and more https://userplane.io/ May 16, 2026 at 11:34PM

Show HN: Got ghosted by tech companies so I built a tool to track ghost jobs https://ift.tt/BlOY7rh

Show HN: Got ghosted by tech companies so I built a tool to track ghost jobs Last year I was looking for a new role. I sent out applications, did the prep, waited. What came back was mostly nothing. Not rejection emails, just silence. The job listings I'd applied to stayed live for weeks. Some for months. As a software engineer, I decided to dig into it properly. I built a system to continuously track job postings across companies, logging posting dates and measuring how long roles stay open before closing or don't. After 35,000+ listings across 200+ companies, some patterns are hard to ignore. Some listings have been open for 700+ days at companies you'd recognize. Others post 90% of their open roles within a single month, a signal that's harder to fake than a press release. I published two initial insight pages based on this work: - Which companies are posting most aggressively right now - Job listings that have been open for over a year What I didn't expect is that the same signals useful for detecting ghost jobs also say something broader about a company's hiring momentum, recruiting intensity, pipeline health, where talent bottlenecks might exist. I'm not sure yet where this leads, but I'll keep expanding the dataset and publishing more insights as I go. Would genuinely love feedback on the methodology, interpretation, or obvious blind spots in the data. https://ift.tt/w5TN1uR May 17, 2026 at 12:43AM

Friday, May 15, 2026

Show HN: SwarmWright, structured multi-agent AI defined in markdowns https://ift.tt/4ESZTPM

Show HN: SwarmWright, structured multi-agent AI defined in markdowns I had a bunch of custom AI pipelines and a growing folder of markdown files and Python scripts holding it together. Built this to give that chaos some structure. Agents are markdown files, topology is a JSON file the runtime enforces hard. The agents are still fully autonomous: they make their own decisions, but the graph they operate in isn't. You declare who can call whom upfront and the runtime holds that line. No auth yet, fine if you don't expose the port, i guess. Two Docker commands to run it. https://ift.tt/HPLpkuc May 16, 2026 at 12:50AM

Show HN: Epiq – Distributed Git based issue tracker TUI https://ift.tt/iZB8eVG

Show HN: Epiq – Distributed Git based issue tracker TUI Issue trackers typically live outside of your workflow, with poor ergonomics. Epiq aims to solve that, bringing issue tracking into your terminal. Multi-user collaboration is achieved via git using user-scoped immutable event logs that converge in memory. Put my all into it. Let me know what you think. https://ljtn.github.io/epiq/ May 16, 2026 at 04:18AM

Show HN: Browser based sythesizer, drum machine and squencer https://ift.tt/vet93BR

Show HN: Browser based sythesizer, drum machine and squencer Inspired by the recent Boards Of Canada announcement, I've been in a low-fi electronica mood lately and was going back and forth with Claude on how to design similar instruments in the browser that fit the genre. One thing led to another and pretty soon I had a fully browser based polyphonic synthesizer / drum machine / sequencer. The interface and workflow was heavily inspired by the Rebirth338 application released back in the 90's, but with lo-fi synth voices rather than the original 303 & 808 emulation. I know there's a significant overlap of developers and musicians and I though some of you may enjoy playing with the app, or at least listening to the resulting album. I've also open sourced track 1 of the album via the performance script used to record it. It's in the repo. Bandcamp link to the resulting album: https://ift.tt/xKXuM7y... https://ift.tt/cwdVfkZ May 16, 2026 at 12:07AM

Show HN: Claude Code vs. Codex Global Usage Leaderboard https://ift.tt/j2tHXWI

Show HN: Claude Code vs. Codex Global Usage Leaderboard https://ift.tt/J1WSqyt May 15, 2026 at 11:18PM

Thursday, May 14, 2026

Show HN: I built a Web-Scraper API that is 6-7x more efficient than current ones https://ift.tt/itx60Aq

Show HN: I built a Web-Scraper API that is 6-7x more efficient than current ones Runo is a web-scraping API that returns typed, structured JSON. You define a schema (field name, type, example value), and Runo fetches the page and returns the data. No HTML, no parsers, no post-processing. Over the past few weeks, I have been building this non stop. Currently, every scraper API out there solves the site fetching problem but left the extraction of the actual data entirely to users. Runo makes that completely disappear. For Runo, I went ahead and added JS rendering, stealth mode, and full LLM extraction to make this a fully functional and capable of scraping most if not all sites. Also, another major problem with current web scrapers is that they charge per feature or bundle them into expensive credit tiers. A single large or JS rendered request can cost 5-75 credits, which means you essentially get nothing out of their plans. Runo is flat per request, no matter the site. At the Scale tier, Runo works out to $0.90 per 1,000 effective requests vs. around $6 for the nearest Firecrawl equivalent. My jaw dropped when I was testing Runo and came across these numbers. I created a free tier that is 500 requests/month, no credit card required. Take it for a spin and let me what can be improved. I would love feedback. https://ift.tt/jw05AQz May 15, 2026 at 01:02AM

Show HN: Visualizing Tiny LLMs from OpenAI's Parameter Golf https://ift.tt/ErUMRiJ

Show HN: Visualizing Tiny LLMs from OpenAI's Parameter Golf The two from parameter golf (one I trained, one was the baseline) are just 16MB each! They produce barely plausible English https://ift.tt/xU6XRFm May 14, 2026 at 10:52PM

Show HN: 3D-Agent – AI that edits Blender scenes through the Python API https://ift.tt/WnMogIL

Show HN: 3D-Agent – AI that edits Blender scenes through the Python API https://ift.tt/o28wvPt May 14, 2026 at 06:47PM

Wednesday, May 13, 2026

Show HN: Nibble https://ift.tt/MSvq8jg

Show HN: Nibble An attempt at a single pass LLVM frontend in ~3000 lines of C without external dependencies, malloc, or an AST. Included are some graphical examples. The IR isn't perfect, and the README touches on one particular downfall https://ift.tt/cUoJ71W May 14, 2026 at 05:46AM

Show HN: Rotunda - A browser built for agents with simulated typing https://ift.tt/6iwW5ML

Show HN: Rotunda - A browser built for agents with simulated typing Hi HN! Pierce here. Rotunda is a firefox fork primarily intended for agent use, which I’ve been hacking on nights/weekends. There was a [lengthy]( https://ift.tt/FYliVX6 ) discussion last week on how expensive computer use models are. The cost is going to drop eventually, but I think on some level it's still usually the wrong primitive. The web gives us access to beautiful structured formats, plaintext, etc... why throw that away if we don't have to? I realized at some point that for 99% of automations I just want agents to be able to control my Chrome instance. But that’s easier said that done: CDP (the Chrome automation protocol) leaks a ton of state about being programmatically controlled, either by toggling window attributes or by running `page.evaluate()` commands right in the page context. Plus if you look at an automation running it's pretty obvious what happens: the mouse jumps around, fields are filled instantly, etc. Rotunda tries to fix this. Its standout features: - Realistic simulation of mouse movements and keyboard commands, powered by a trained RNN on my own timing patterns from the last week. (still feel weird about opting-in to a key logger but whatever) - Doesn’t lie about its host specs, only fibs about some client side details. Stealth browsers are too easy to flag statistically when you’re adding noise to canvas pixels or audio pipelines. - It runs on your local device with a CLI or Playwright API accessible to Claude, Codex, or whatever your harness-de-jure today looks like. - Patches modern Firefox (150) with an agentic harness to keep this updated over time MPL-2.0 on GitHub: https://ift.tt/f43TChv Longer writeup on the design choices: https://ift.tt/D6g1yPi Also check out the demo on the site! https://www.rotunda.sh/ Pretty excited by how this turned out but we’re still super early. Give it a try and please flag any issues! https://ift.tt/f43TChv May 13, 2026 at 05:44PM

Tuesday, May 12, 2026

Show HN: I spent $100 in Claude tokens and 1k battles training my AI tank https://ift.tt/FK8aBHt

Show HN: I spent $100 in Claude tokens and 1k battles training my AI tank Hi HN, I built AgenTank. It is a small game where an AI agent writes the logic for your tank. You watch it fight, give strategic feedback, let the agent update the tank code, and send it back into battle. I have run 1,000+ battles on my own tank and spent about $200 in Claude credits improving it. The part I enjoy most is not just winning, but watching the tank make visible mistakes, thinking of a better strategy, and seeing whether Claude can turn that into better code. https://ift.tt/pTao2cI May 13, 2026 at 06:20AM

Show HN: Duckflix, an open-source self-hosted media streaming platform https://ift.tt/faWEORy

Show HN: Duckflix, an open-source self-hosted media streaming platform I’ve been working on Duckflix, a self-hosted media streaming platform. It started as a full-stack project to combine a clean streaming UI with a Bun/Elysia backend, FFmpeg processing, SQLite, Docker deployment, and addon support. Website: https://duckflix.fun Demo: https://demo.duckflix.fun GitHub: https://ift.tt/ulfkv2A https://ift.tt/ulfkv2A May 13, 2026 at 01:23AM

Show HN: GIF Pile. a site to make piles of GIFs https://ift.tt/ZrbiuCs

Show HN: GIF Pile. a site to make piles of GIFs I'm quite fond of obnoxious looking gifs in a post-ironic way as a manner of shitposting and or injecting humor into a chat. The issue with this however is that, for no real good reason at all, the simple usecase of "Have image/gif background, bombard with garbage" had no real good tooling. There's gif editors out there, EZgif my beloved is probably my most used non-search-indexing-slash-social-media-site, but they're kinda clunky for my specific usecase of making digital eye-sandpaper bombastic garbage. Other options are bleak and gave me the mark of the beast via shitty watermarks. I just wanted a pile of gifs on top of each other, and thus far the "easiest" way was to bust open a video editor, muck around with it, mess up exporting as a gif directly, get mad, export it as a 4 second mp4, and then use ffmpeg to get it working. is this probably moronic? yes. am I likely to have missed a decent tool? yes. Did I give up looking after sending 4 dollars to some Indian guy for "No watermarks ever for 4$", only for that "ever" to be a year, and then the clunky weird af login process not work? absolutely. (Fuck you, you know who you are) This took me a few hours (most of which was dealing with the fact I don't do webshit normally and the clunk that one would expect from that), and is a minimal site for my personal minimal usecase. It's static because I'm not going to deal w/ hosting other people's shit and I don't want to deal with that can of worms. all processing is done locally on your browser. Yes, this means that using a 4k image as a base layer for your gif pile will make it take an age. It'll work eventually though. This will never have a watermark unless I'm bought out (total investment thus far has been 14 bucks, 4 of which was that one dude fucking me), in which case I probably earned it. at most I'll likely throw adsense on there at some point to scrape a few cents from the people who can't figure out adblock if it gets popular enough for me to warrant it. There's no timelines or anything like that. literally just a pile of gifs. thus far my primary usecase has been overlaying text gifs from the various fancy text generator sites onto glitter backgrounds with uncomfortable rat GIFs to call people poor on the internet. this makes me happy. There's likely to be obvious UI, UX, or other U-whatever fuckups. If you point them out and I deem it pedantic I'll probably laugh at you. if it's helpful I'll probably implement it when I get a bit. Surprisingly, works on mobile. CSS is exceedingly generic and souless atm, just went off vauge memories of ss13's TGUI. I'll likely scrap the CSS entirely and go full neocities at some point because that's more soulful. https://gifpile.com/ May 13, 2026 at 01:11AM

Show HN: I submitted 316 AI-generated PRs to open source https://ift.tt/7iv5D3m

Show HN: I submitted 316 AI-generated PRs to open source https://june.kim/speedrunning-open-source May 12, 2026 at 10:12PM

Monday, May 11, 2026

Show HN: OpenGravity – A zero-install, BYOK vanilla JS clone of Antigravity https://ift.tt/KNgr4eM

Show HN: OpenGravity – A zero-install, BYOK vanilla JS clone of Antigravity https://ift.tt/txi5kvC May 12, 2026 at 12:23AM

Show HN: Mimik – open-source local-first alternative to Scribe and Tango https://ift.tt/ja0g5fI

Show HN: Mimik – open-source local-first alternative to Scribe and Tango https://ift.tt/O57npNg May 11, 2026 at 09:48PM

Sunday, May 10, 2026

Show HN: adamsreview – better multi-agent PR reviews for Claude Code https://ift.tt/FjQMgOJ

Show HN: adamsreview – better multi-agent PR reviews for Claude Code I built adamsreview, a Claude Code plugin that runs deeper, multi-stage PR reviews using parallel sub-agents, validation passes, persistent JSON state, and optional ensemble review via Codex CLI and PR bot comments. On my own PRs, it has been catching dramatically more real bugs than Claude’s built-in /review, /ultrareview, CodeRabbit, Greptile, and Codex’s built-in review, while producing fewer false positives. adamsreview is six Claude Code slash commands packaged as a plugin: review, codex-review, add, promote, walkthrough, and fix. I modeled it after the built-in /review command and extended it meaningfully. You can clear context between review stages because state is stored in JSON artifacts on disk, with built-in scripts for keeping it updated. The walkthrough command uses Claude’s AskUserQuestion feature to walk you through uncertain findings or items needing human review one by one. Then, the fix command dispatches per-fix-group agents and re-reviews the work with Opus, reverting any regressions before committing survivors. It runs against your regular Claude Code subscription (Max plan recommended), unlike /ultrareview, which charges against your Extra Usage pool. I would love feedback from Claude Code users, pro devs, and anyone with strong opinions about AI code reviews. Repo: https://ift.tt/3jsXF20 Install: /plugin marketplace add adamjgmiller/adamsreview, /plugin install adamsreview@adamsreview https://ift.tt/3jsXF20 May 11, 2026 at 06:06AM

Show HN: I trained a chess engine to play like humans https://ift.tt/n3xM6Ut

Show HN: I trained a chess engine to play like humans I built 1e4.ai - a chess web app where you play against neural networks trained to mimic human Lichess players at specific Elo ranges. There's a separate model for each 100-point rating bucket from ~800 to 2200+, and the bots not only choose human-like moves but also burn clock time, play worse under time pressure, and blunder in human-like ways. Live demo: https://1e4.ai Code: https://ift.tt/B4DrKcd A few things that might be interesting: - Trained on almost a full year of Lichess blitz games, around 1B total games - Architecture is an a small (~9MM parameters) transformer-based network that takes the board, recent move history, the player's rating, and remaining clock time as input. Three separate models per rating bucket: move, clock-usage, and win probability. The clock model is what makes the bots feel humanish under time pressure rather than instant. Because the move model takes the clock as one input parameter, it also learns to blunder under time pressure like a human might. - Because the network is so tiny, no GPU is needed for inference - it runs easily on a local CPU - Downside of the tiny network is that it's a bit weak as you turn up the rating past around 1700. It can spot short tactics but not long multi-move combinations. - Initial training on a rented 8xH100 cluster, then fine-tunes on my local GPU for different rating ranges - Inspired by Maia-2 and DeepMind's "Grandmaster-Level Chess Without Search". On a held-out Lichess blitz benchmark, the it beats Maia-2 blitz on top-1 move prediction (56.7% vs 52.7%) and pretty substantially on win-probability calibration (Brier 0.176 vs 0.272). Numbers and code in https://ift.tt/8xM3S57... - The data pipeline is C++ via nanobind, then training with Pytorch. Getting this right was actually the thing I spent the most time on. Pre-shuffling the dataset and then being able to read the shuffled dataset sequentially at training time kept the GPU utilization high. Without this it spent a huge percentage of time on I/O while the GPU sat idle. Happy to answer questions about the rating-conditioning, the clock model, or the data pipeline. May 11, 2026 at 02:31AM