Do transformers need three projections? Systematic study of QKV variants

https://arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png
Researchers evaluated three projection sharing constraints in transformers, finding that sharing query, key, and value projections (Q-K=V) achieves comparable or better performance with 50% cache reduction in language modeling. This approach is complementary to head sharing, enabling up to 96.9% cache reduction for practical on-device inference.

Anthropic's open-source framework for AI-powered vulnerability discovery

https://opengraph.githubassets.com/302e26ce5e3a285c560d0c018fc6df7677c5c594df5180a71cfe26ccf02ff69c/anthropics/defending-code-reference-harness
This is a reference implementation for autonomous vulnerability discovery and remediation using Claude, which includes a lightweight SDK-only walkthrough and a managed option called Claude Security. The implementation provides a step-by-step guide to building a vulnerability finding pipeline, customizing the logic, and using it with various access to Claude APIs.

VoidZero Is Joining Cloudflare

https://cf-assets.www.cloudflare.com/zkvhlag99gkb/4EEGE4niWUjPwG5iCAFiuQ/6e63dca6bb8a0cf26d0f677727e3c356/BLOG-VOID_1.png
Cloudflare is acquiring VoidZero, the company behind Vite, but Vite will remain open source and vendor-agnostic. Cloudflare will invest $1 million in a Vite ecosystem fund to support maintainers and contributors.

When AI Builds Itself: Our progress toward recursive self-improvement

https://www.anthropic.com/_next/image?url=https%3A%2F%2Fwww-cdn.anthropic.com%2Fimages%2F4zrzovbb%2Fwebsite%2F52a19d636c659cf4515dc0d7d70b8ceb1bbfd768-2200x1276.png&w=3840&q=75
AI systems are rapidly accelerating their own development, with Anthropic's Claude model writing 80% of the company's code and improving at a rate of 3x per year. This trend points to a future where AI systems can fully design and develop their own successors.

Branchless Quicksort faster than std:sort and pdqsort with C and C++ API

Blqsort is a fast sorting algorithm that uses branchless partitioning and custom sorting networks for small subsets. It can be used in C++ and C with multithreading support and is suitable for sorting custom data structures.

Queen bees emerge from special wax chambers

https://s7d1.scene7.com/is/image/CENODS/How-worker-bees-use-wax-chemistry-to-sculpt-a-future-queen---488257?$responsive$&qlt=90,0&resMode=sharp2&fmt=webp
Scientists discovered that queen bee cells are made of chemically engineered wax that helps future queens survive and develop. The wax's unique properties, such as lower density and higher melting temperature, play a crucial role in raising new queens.

IPv6 zones in URLs are a mistake

https://xeiaso.net/.within.website/x/cmd/anubis/static/img/pensive.webp?cacheBuster=v1.25.1-0.20260604200537-44d5fa3ce047
Run away while you still can, it's not too late for you to avoid the curse of knowledge.

Ian's Secure Shoelace Knot

The Ian's Secure Shoelace Knot is a secure and symmetrical knot that won't come undone on its own. It involves wrapping loops around each other and feeding them through a hole in the middle to create a tight, closed knot.

I'm skeptical about efforts to revolutionize schooling

https://www.scotthyoung.com/blog/wp-content/uploads/2026/05/Revolutionizing-education.jpg
The author, who wrote the book Ultralearning, is skeptical of dramatic proposals to reform schools because research consistently shows that direct instruction and practice are the most effective teaching methods. He believes that schools should focus on increasing efficiency and choosing what to learn, rather than trying to make education more like play or using technology to gamify learning.

Retro-Tech Parenting

https://havenweb.org/assets/images/retro.jpg
As a technologist and parent, I'm choosing to share enriching aspects of technology with my kids by using physical media like CDs and DVDs, and setting up a home phone and computer with curated access to the internet.

KVarN: Native vLLM backend for KV-cache quantization by Huawei

https://raw.githubusercontent.com/huawei-csl/KVarN/main/imgs/logo_600.png
KVarN delivers FP16-level accuracy with up to 4x KV-cache capacity and 1.3x throughput. It's a calibration-free, plug-and-play solution for vLLM with native attention backend.

Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate

https://arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png
Multi-agent debate has been shown to improve reasoning in large language models (LLMs). However, it is compute-intensive, requiring generation of long transcripts before answering questions. To address this inefficiency, we develop a framework that distills multi-agent debate into a single LLM through a two-stage fine-tuning pipeline combining debate structure learning with internalization ...

WSL 2 is getting faster Windows file system access

https://www.boxofcables.dev/content/images/size/w1200/2026/05/Screenshot-2026-05-31-174734.png
WSL 2's file I/O performance has improved with a change in May 2026 that removes a virtiofs bottleneck. This optimizes cross-OS file access for file-heavy workloads.

Castor: CERN Advanced STORage Manager

https://castor.web.cern.ch/sites/castor.web.cern.ch/files/castor_transparent_icon_smaller.png
CASTOR is a hierarchical storage management system developed at CERN for archiving physics data. It provides access protocols and manages data storage on disk and tape.

External Clock Generation on RTX 50 Series

https://static.wixstatic.com/media/8ff3c1_4ff551a7460146b69b7177ebedc5b16f~mv2.jpg/v1/fill/w_147,h_110,al_c,q_80,usm_0.66_1.00_0.01,blur_2,enc_avif,quality_auto/8ff3c1_4ff551a7460146b69b7177ebedc5b16f~mv2.jpg
By PickleRick of Xtreme Systems Credits to Turbogear of Xtreme Systems, this would not have been possible without his expertise and RF engineering background Brief history Starting in the late 1970s, thru the 80s, and into the early 90s, many of the first overclocks were achieved thru crystal swapping. Systems from this time period often had no provisions for altering clock speeds via ...

Samurai City

https://worksinprogress.co/.netlify/images?url=https%3A%2F%2Fassets.worksinprogress.co%2Fwp-content%2Fuploads%2F2026%2F05%2Fcover_issue_24_900w.webp&w=500&h=587&fit=contain
Tokugawa Edo was a unique city where Japan's elites were concentrated and monitored, with a dual nature of being both the apex of society and a prison. The city's physical form was shaped by the government's need to control and monitor the population, resulting in a highly restricted and stratified urban environment.
https://danielmangum.com/static/jlink_jtag_pinecil_0.jpg
User connects JLink to Pinecil using a breakout board and female-to-female dupont wires. This setup allows for debugging with gdb using JLinkGDBServer and port 2331.

Making Debian or Fedora persistent live images

When creating a live ISO, tweaks or installed packages vanish after a reboot due to ISO9660 being a read-only filesystem. A common workaround is to use OverlayFS or create an ext4 partition and tell the live OS to use it during boot.

Meta's ships facial recognition on smart glasses

https://storage.ghost.io/c/b8/53/b853e3d4-3186-409d-9c7f-7da931a60431/content/images/2026/06/10_foucault_input.jpg
The Meta app Stella for smart glasses has a complete on-device facial recognition system, including detection, alignment, embedding, and notification, but it is not active for ordinary users. The system is functional and can recognize faces, but it requires enrollment data and is not enabled in production, suggesting it is a capability waiting to be shipped.

Show HN: FFmpeg WebCLI – Full FFmpeg in Browser, Offline PWA, No Uploads(WASM)

https://raw.githubusercontent.com/tejaswigowda/ffmpeg-webCLI/main/demos/makegif.gif
A browser-based video editor uses WebAssembly to process videos locally without uploading or servers. It offers 30+ video operations, offline support, and multi-format support.

Zettascale (YC S24) Is Hiring Founding FPGA Engineers

https://bookface-images.s3.us-west-2.amazonaws.com/logos/1b0f67538c8ac4bd55241ea00049f4fe5fbf840b.png?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=ASIAQC4NIECAOY32WVHE%2F20260604%2Fus-west-2%2Fs3%2Faws4_request&X-Amz-Date=20260604T170910Z&X-Amz-Expires=3600&X-Amz-Security-Token=IQoJb3JpZ2luX2VjEJH%2F%2F%2F%2F%2F%2F%2F%2F%2F%2FwEaCXVzLXdlc3QtMiJIMEYCIQCNNyjMMs%2Bt3OEqODfsPau91YBM3fzw2wBqOmVsUQK2KQIhALK1oAcEh8lMSZeJkJ3f%2BomSiGJtPt0mY661k%2FH%2FGw28KuUDCFoQABoMMDA2MjAxODExMDcyIgyxyZv%2FoKxwTec2gscqwgNHwB%2Fg8sCPO8%2FuBunURSjFAbXBOT%2FdUHT29a8sAKUwovairDAoJbMEsXk2qUehg7AoCS6DaGnVlIZx4Hbjdribl6E8F69GBQlkF%2FOIEBq0sIDT20sOyyei1psB6YxWGv0WmMVADhRbOdEmYA%2FNbkjtKJcu9N1xZSIG8WY%2Fo492KjBN0XuE2n2m1nffSb9tQgcoU5LuFi4cxFk7FcP6e%2B%2FCdTe1XU5eFN3Trsv%2FL6XUGtWSAYm0etbpaQ%2BDy2rv0hCOItSq4sGlZ%2BKEOOQwmOlPckcXZzkEpClnv1%2BW6UqSAhOQwFjOG6GazhW7JXI8yJuvhlooXqBUqRFZkkvLiCXZnSE6eJEn6j9aNnY62I9Q4hJZ37f8T3TZ0fwEiyArGnSJwUS97tl2p0k2%2BkKZI%2FqRwPjuHKYNHaHzGmGgSHeYELE%2FQvLuDmMSDqTjUmR%2BkMmwJ%2BenGDxxl%2FS3wFvnZlOrF%2FXZNjnpe50NTL72z%2BdsLx1eZcc8%2FEtXuxc9c8DydQg43sHGxpZNKfhxfP0B4uQgsizKMRs9kGF%2FzSLFTycjxlxJa7BjrSeLTd2M8YMgKbpbfzz%2BVHv62rw%2FaxbMH0vGiWQw3duG0QY6pAG2qgWQk1NHiGFhN3%2BdjcgY4FFXYhg0mmajFWvtVMU3lPxAo0HfvFN7Bjjom7M0oQMyfb3R5eiMfd5IsBv4En7eWBAlyA9vOl7zoYtGZk%2FvUE14n54FbYrK5LKbNRJwcCbHoaeOL1oqj0W%2FrUT8j4uBxst4KBpIvdoVNIn%2FMlV5WKedwS93B3qaO2M9ZgCijtJJ8jWN6DXqZfQrSYIbyIPa4x1VEg%3D%3D&X-Amz-SignedHeaders=host&X-Amz-Signature=171a2d9b413215e8b9cd504f9d8e727924b1921379e7a7a4344ccba80ca85b04
Zetta is building next-gen AI chips, seeking a technical powerhouse with expertise in digital design, RTL quality, and synthesis. The role offers a chance to shape revolutionary AI technology, work with the founding team, and own critical decisions.

Show HN: Mercek – A Desktop IDE for AWS ECS

https://www.mercek.dev/screenshots/hero.png
Merek is a desktop app for working with Amazon ECS, showing services across accounts and regions without collecting telemetry. It connects to AWS using existing credentials and allows for tasks such as deployment, cost estimation, and log inspection.

Sum-product, unit distances, and number fields

The unit distance and sum-product conjectures over the reals have been recently disproved. a number of counterexamples can be found in any graduate course on algebraic number theory. the exact statement required may be hard to extract from existing literature, but this is just because before these kind of constructions there was little reason to actually prove such statements. it's also ...

Show HN: Uruky (EU-based Kagi alternative) now has Image Search and URL Rewrites

https://uruky.com/public/images/logo.svg
Uruky is a private EU-based search engine with no ads, tracking, or analytics, offering personalized results and a copy of the source code after 12 months of paid service. It focuses on search functionality without AI, assistants, or news, prioritizing user privacy and independence.

Mornings and nights no longer exist at 47C: A day in the hottest place in India

https://ichef.bbci.co.uk/ace/standard/1600/cpsprodpb/d374/live/b286ba40-5d8a-11f1-9eb5-ab3a963c44d3.jpg
Banda district in India's Uttar Pradesh state is experiencing extreme heat with temperatures reaching 47-48C for over a week. Residents are adapting by rearranging their lives, seeking shade, and working in shorter intervals to avoid heat sickness.

Gaussian Point Splatting

https://momentsingraphics.de/Media/Siggraph2026/representative_image.webp
Researchers propose Gaussian point splatting, a method to render Gaussian splats efficiently. It uses parallel programming and culling to render hundreds of millions of Gaussians in real time.

Show HN: Hitoku Draft – Context aware local assistant

$5 Launch price · one-time Early supporters lock in $5. Price rises as the app grows. Get Hitoku Draft for $5 Press your hotkey in any text field on your Mac. Voice-to-text and AI editing activate instantly. 録 Drop in audio or video recordings and get clean text — on your Mac. Refine it with your voice. Get Hitoku Draft

AI, Ashby Engineering, and the future

https://a-us.storyblok.com/f/1017310/1325x746/d767887728/ai-vs-bugs.png
Ashby Engineering uses AI to generate code, but emphasizes human judgment and empathy to ensure quality and customer understanding. Engineers focus on high-level decisions, while AI handles mechanical tasks, freeing them to build great products and understand customer needs.

U.S. Army Corps of Engineers Bay Model

https://upload.wikimedia.org/wikipedia/commons/thumb/8/87/USCAE_Bay_Model_-_San_Francisco_Bay_Detail.jpg/330px-USCAE_Bay_Model_-_San_Francisco_Bay_Detail.jpg
The Bay Model is a working hydraulic scale model of the San Francisco Bay and Sacramento-San Joaquin River Delta System, located in Sausalito, California. It was built in 1957 to study proposed dam plans but is now open to the public for educational exhibits about Bay hydrology.

3D-printed book turns its own G-code into raised lettering

https://static.designboom.com/wp-content/uploads/2026/05/darius-ou-manual-3D-printed-book-designboom-600.jpg
Studio Darius Ou and Benson Chong created Manual, a 3D-printed book carrying part of its own machine code. It uses an XY-for-Z printing method to materialize as a fully bound book directly from the machine.