Compiling LLMs into a MegaKernel: A path to low-latency inference

https://miro.medium.com/v2/resize:fit:841/1*xW52lnX0OeobpeA2_BvX1w.png
Researchers developed a compiler called Mirage Persistent Kernel (MPK) that automatically transforms large language model (LLM) inference into a high-performance megakernel, reducing latency by 1.2-6.7x. MPK's end-to-end GPU fusion approach eliminates kernel launch overhead and maximally overlaps computation, data loading, and inter-GPU communication across layers.

Show HN: I wrote a new BitTorrent tracker in Elixir

https://repository-images.githubusercontent.com/968272878/e35de260-d3f9-411d-a9f3-bc9dc13346b6
The project is a Work In Progress with a testing instance at extracker.dahrkael.net:6969, requiring Erlang and Elixir. It can be run with a docker image or custom release files.

Literate programming tool for any language

https://opengraph.githubassets.com/a9406fa6fbe556a5025c9544f574a93ee2fd9a41e35de74d3430dcf297279b35/zyedidia/Literate
Literate programming is a style where code is written for humans to read and understand, with explanations and code snippets. It's a tool that simplifies Knuth's CWEB system and adds features, written in D programming language.

Curved-Crease Sculpture

https://erikdemaine.org/curved/Freedom/0731-039_square_400.jpg
Researchers explore self-folding origami with curved creases for deployable structures and manufacturing. Curved origami sculpture has a surprising history dating back to the 1920s at the Bauhaus.

Andrej Karpathy: Software in the era of AI [video]

Show HN: RM2000 Tape Recorder, an audio sampler for macOS

https://rm2000.app/_astro/Mac_Universal_logo.BqK_kNrl.png
Your Mac plays sound all day - it just doesn't make it easy to keep any of it. Recording system audio is still very clunky: audio routing hacks, third-party tools that aren't meant for the task - too many steps. RM2000 Tape Recorder just listens. Whatever your Mac is playing, it can record it instantly - then, lets you file, tag, and organize it.

Homegrown Closures for Uxn

https://krzysckh.org/res/hearts.gif
The user created niënor, a lispy environment for uxn, with a compiler and macroexpander that generates uxn roms from scheme-like code. They implemented closures by adding environment parameters and generating portal functions to bind variables at runtime.

Show HN: EnrichMCP – A Python ORM for Agents

https://opengraph.githubassets.com/fa6a37a6f7c2c290dc77ce0f05d70d9116fe46bf16cb156f4ccc2f33825d404f/featureform/enrichmcp
EnrichMCP is a Python framework that turns data models into a semantic MCP layer for AI agents to understand and navigate. It adds a typed, discoverable layer to data models, enabling AI agents to work with data as naturally as developers using an ORM.

Show HN: A DOS-like hobby OS written in Rust and x86 assembly

https://opengraph.githubassets.com/29a94f95f31006d1281d04aef3e125a9bf2c5977a89b60b48caf48def3210aed/krustowski/rou2exOS
To run the OS, use the attached ISO image in QEMU emulator or boot from a USB flash disk on x86_64 baremetal. Install dependencies and compile the kernel and bootloader to create an ISO image.

How OpenElections uses LLMs

https://thescoop.org/archives/2025/06/09/how-openelections-uses-llms/limestone.png
OpenElections uses Google's Gemini Large Language Model (LLM) to convert image PDFs of election results into CSV files, achieving high accuracy and speed. Gemini's ability to handle large PDFs and provide domain-specific logic makes it a valuable tool for parsing election results, but it can struggle with sustained processes and may require manual cleanup.

Extracting memorized pieces of books from open-weight language models

https://arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png
Researchers studied large language models' memorization of copyrighted books, finding varying levels of memorization among different models and books. Results show that while some models like Llama 3.1 70B memorize entire books, others don't memorize most books, complicating copyright cases.

Guess I'm a Rationalist Now

https://149663533.v2.pressablecdn.com/wp-content/uploads/2021/10/cropped-Jacket.gif
The author attended LessOnline, a rationalist conference, and finally felt comfortable identifying as a Rationalist after years of hesitation due to concerns about the community's cult-like vibe and fear of sneers from others. The author now feels part of a community that shares his values and interests, and is excited to collaborate with them on projects like AI alignment and theoretical ...

String Interpolation in C++ Using Glaze Stencil/Mustache

Glaze provides string interpolation for C++ structs through stencil and mustache formats. It enables dynamic output generation by combining templates with structs.

Show HN: Claude Code Usage Monitor – real-time tracker to dodge usage cut-offs

https://raw.githubusercontent.com/Maciek-roboblog/Claude-Code-Usage-Monitor/main/doc/sc.png
This tool monitors Claude AI token usage, providing predictions and warnings about token consumption. It offers customizable settings for plan, reset hour, and timezone.

What would a Kubernetes 2.0 look like

https://matduggan.com/content/images/2025/06/image-2.png
The user discusses the evolution of Kubernetes from its early days as a Google project called Borg to its current status as a widely adopted container orchestration platform, highlighting its benefits and challenges. The user proposes several ideas to improve Kubernetes, including replacing YAML with HCL, improving etcd, replacing Helm with a new package manager called KubePkg, and defaulting to IPv6.

Star Quakes and Monster Shock Waves

https://caltech-prod.s3.amazonaws.com/main/images/Neutron-Star-Monster-ShockWa.2e16d0ba.fill-1600x810-c100.jpg
Scientists simulated a neutron star collision with a black hole, predicting flares and radio signals that telescopes might detect. The simulations, run on supercomputers, provide a deeper understanding of the universe's most energetic events.

Show HN: Unregistry – “docker push” directly to servers without a registry

https://opengraph.githubassets.com/485f95328cb8c364102a263c8c8f6d3f4cf3fd571c5e13cfa7af891dbd18364e/psviderski/unregistry
Unregistry is a lightweight container image registry that transfers missing image layers over SSH for efficient deployment. It eliminates the need for public code, private repos, or intermediate storage.

We Can Just Measure Things

The author spent 24 hours building a project with friends and agents, which led to frustration with Xcode and thoughts on how programming agents can measure code quality and developer experience. Agents can help evaluate code health, tooling, and ecosystem stability, providing objective measurements for improving developer experience.

Public/protected/private is an unnecessary feature

Access modifiers in OOP were originally used to protect implementation internals in Simula due to its heavy use of inheritance. However, they duplicate interface-defining features and are unnecessary if interfaces are used to hide implementation details.

Flowspace (YC S17) Is Hiring Software Engineers

https://app.jazz.co/img/apps/share_a_job-icon.jpg
This website uses cookies and other analytics technologies. By selecting "Allow", you consent to the recording, use and sharing of your website activity by this website and its service providers.

DNA floating in the air tracks wildlife, viruses, even drugs

https://www.sciencedaily.com/images/scidaily-icon.png
Researchers used air filters to capture environmental DNA in Dublin, revealing the presence of cannabis, poppy, and magic mushrooms. This method can track wildlife, humans, and illicit drugs, opening up possibilities for studying species without disturbing them.

Testing a Robust Netcode with Godot

https://studios.ptilouk.net/img/blog/meme_tcp_udp.jpg
The developer of Little Brats! faced challenges with online multiplayer, particularly synchronizing computers with latency while maintaining a fast-paced action game aspect. They used techniques like prediction and reconciliation to compensate for latency.

Visual History of the Latin Alphabet

https://uclab.fh-potsdam.de/arete/assets/images/help_en.png
The history of writing and printing spans from ancient libraries to modern digital fonts, with significant milestones in technology, culture, and society. Key developments include the invention of movable type, the Industrial Revolution, and the emergence of digital technologies like the World Wide Web.

Show HN: Tiny Hoare logic verifier using SMT

https://opengraph.githubassets.com/7882fd10b14f98cfa279fadff78b586dc5a361bcda95f8c367f4cde65ea68a12/namin/metaprogramming
Skip to content You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session. Dismiss alert You can’t perform that action at this time.

Posit floating point numbers: thin triangles and other tricks (2019)

http://marc-b-reynolds.github.io/assets/figures/posit/equalateral.png
Posits can be much more accurate than binary32. ieee-754 doesn’t specify a random rounding mode - although researchers have played with the concept 'rounding' in binary vs. posits is trivial to contrive an example where ieee would win but simple or convoluted examples boils down to measuring the size of ulps in the two formats doesn't tell us anything we shouldn’d already

Munich from a Hamburger's perspective

https://mertbulan.com/images/posts/munich/ludwig-and-artists.webp
The user visited Munich, Germany, and compared it to their hometown Hamburg, noting differences in history, culture, and lifestyle, with Munich feeling more wealthy and car-centric. While they enjoyed Munich's museums, natural spots, and food, they ultimately prefer Hamburg's more independent and trade-focused atmosphere and wouldn't want to live in Munich.

In-Memory C++ Leap in Blockchain Analysis

https://caudena.com/wp-content/uploads/2025/06/Cfd-cover.png
Caudena's CashflowD (CFD) is a proprietary analytics engine that processes blockchain data at scale, providing real-time insights and court-admissible evidence. CFD's in-memory database and JIT-compiling query engine deliver 200-400X lower infrastructure costs and unparalleled analytical power.

Getting Started Strudel

https://strudel.cc/icons/strudel_icon.png
Strudel is a JavaScript port of Tidal Cycles for making music with code, requiring no prior knowledge of JavaScript or Tidal Cycles. It's a flexible sequencer that can integrate into existing music setups via MIDI or OSC.

From LLM to AI Agent: What's the Real Journey Behind AI System Development?

https://www.codelink.io/_next/static/chunks/images/strapi-bucket-production/AI_System_Development_2500x1667_acc63958dc/AI_System_Development_2500x1667_acc63958dc_2048.webp
Large Language Models (LLMs) can be used for various tasks, but their limitations require additional capabilities for real-time information and decision-making. AI Agents offer greater autonomy, but simpler solutions like workflows and retrieval methods are often sufficient for many real-world use cases.

Why do we need DNSSEC?

https://howdnssec.works/public/images/oembed/ep1.png
DNS translates human-friendly names to IP addresses, acting like a phone book for devices. DNSSEC adds security by authenticating DNS responses, preventing tampering, but data remains non-confidential.