The last six months in LLMs, illustrated by pelicans on bicycles

https://static.simonwillison.net/static/2025/ai-worlds-fair/ai-worlds-fair-2025-01.jpeg
The author presented a keynote at the AI Engineer World's Fair in San Francisco, discussing recent advancements in Large Language Models (LLMs). They evaluated 30 models released in the last six months, including Llama 4, GPT-4.5, and Claude 3.7 Sonnet, using a custom benchmark to compare their performance.

Building supercomputers for autocrats probably isn't good for democracy

https://substackcdn.com/image/fetch/w_520,h_272,c_fill,f_auto,q_auto:good,fl_progressive:steep,g_auto/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc14b5829-f26f-4d2c-a624-daa066587912_1858x1326.png
OpenAI partnered with the UAE to build large-scale AI data centers, claiming it promotes democratic values. However, the UAE has a poor human rights record and lacks democratic freedoms.

We’re secretly winning the war on cancer

https://platform.vox.com/wp-content/uploads/sites/2/2025/06/GettyImages-1581229470.jpg?quality=90&strip=all&crop=0.0244140625%2C0%2C99.951171875%2C100&w=2400
Jon Gluck, a cancer survivor, has lived with multiple myeloma for over 20 years, defying initial predictions of an 18-month lifespan. His story highlights the significant decline in cancer deaths in the US, with a third fewer cancer deaths since 1991.

Why Android can't use CDC Ethernet (2023)

https://jordemort.dev/images/android-cdc/connection-settings.jpg
To determine which USB Ethernet adapters are compatible with an Android device, you need to find the kernel configuration for your phone and look for USB_NET configuration variables that match the chipset of the adapter you want to use. The kernel configuration can be found by enabling USB debugging, installing ADB, and running shell commands on the phone to discover the kernel version and ...

Self-hosted x86 back end is now default in debug mode

Zig compiler now uses its own x86 backend by default for x86_64 targets, improving compilation speed and robustness. This change allows for faster compilation of large projects like the Zig compiler itself.

What happens when people don't understand how AI works

https://cdn.theatlantic.com/thumbor/LKXTz2WhmvY44e1AbP9DBkxL-4w=/0x0:2000x1125/960x540/media/img/mt/2025/06/2025_6_2_AI_Illiteracy/original.gif
A British writer Samuel Butler predicted a "mechanical kingdom" in 1863, now a reality with AI. Two books, "Empire of AI" and "The AI Con," suggest the AI industry is a scam, exaggerating AI's capabilities.

A look at Cloudflare's AI-coded OAuth library

The author reviewed CloudFlare's OAuth library generated with Anthropic's Claude LLM, finding some security issues and questionable design choices. The library has some good aspects, but it's not yet ready for use due to its incomplete testing and potential vulnerabilities.

I used AI-powered calorie counting apps, and they were even worse than expected

https://lifehacker.com/imagery/articles/01JWGWP5A9JX4QHHXMWT06PHN1/hero-image.fill.size_1248x702.v1748618646.png
AI-powered calorie counting apps are flawed due to human error in ingredient identification and portion estimation. Traditional methods with food scales remain more reliable for precise tracking.

Ask HN: How to learn CUDA to professional level

To learn CUDA programming, start with NVIDIA's standard courses and practice with simple projects that you can port to CUDA and benchmark against CPU. Focus on learning high-performance computing approaches and massive-parallel programming, which can translate to transferable skills.

Administering immunotherapy in the morning seems to matter. Why?

https://substackcdn.com/image/fetch/w_520,h_272,c_fill,f_auto,q_auto:good,fl_progressive:steep,g_auto/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64b823c6-4d7f-45d7-b655-75fd8fe339a0_2912x1632.png
Research suggests that administering immunotherapy in the morning may improve cancer treatment outcomes due to the body's natural circadian rhythm and immune system preparation. A randomized clinical trial and multiple retrospective studies support this idea, but more research is needed to confirm the findings and understand the underlying mechanisms.
https://danq.me/_q23u/2020/11/dreamweaver-blink-marquee.png
A web developer made a joke about the HTML <blink> and <marquee> tags, which were popular in the 90s but have since fallen out of use. They were used to create flashing and scrolling text, but are now considered outdated and inaccessible.

Louis Rossmann: We've started a foundation to bring back ownership [video]

Omnimax

https://computer.rip/f/omnimax/b.jpg
The sphere is a theater and venue in las vegas, nevada. the fleet planetarium was not the only omnimax theater for long, says robert s. heinz if you look closely, you can see the projectionists at work preparing the screen for the film 'voyage to the outer planets' the end of aggressive support from imax meant that filming costs became impractical

Gaussian integration is cool

https://rohangautam.github.io/img/wgxIX_3WfQ-400.webp
Gaussian quadrature is a numerical integration technique that uses function evaluations at special points called nodes to estimate a definite integral. It can estimate a polynomial of order 2n-1 with n nodes and another set of n weights.

Riding high in Germany on the world's oldest suspended railway

https://i.guim.co.uk/img/media/b3b99c72fd2cce4c7f0b42b0ddb9daa54afa906e/504_0_5040_4032/master/5040.jpg?width=445&dpr=1&s=none&crop=none
The Wuppertal Schwebebahn in Germany is a 125-year-old suspended railway that remains an incredible feat of engineering. It offers a unique commuter experience with sleek modern carriages gliding above the river Wupper.

Demystifying Debuggers

https://substackcdn.com/image/fetch/w_520,h_272,c_fill,f_auto,q_auto:good,fl_progressive:steep,g_auto/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1960a5b1-bafa-433f-8529-8051ae2ae8c6_1920x1080.png
The author discusses the importance of debuggers in programming, highlighting their utility in analyzing working code, verifying correctness, and providing educational value. They plan to write a series explaining debugger architecture.

Poison everywhere: No output from your MCP server is safe

https://www.cyberark.com/wp-content/uploads/2025/05/poison-everywhere-blog.png
The Model Context Protocol (MCP) is an open standard that allows large language models to interact with external tools, but it has a vulnerability known as Tool Poisoning Attack (TPA) that can be exploited by manipulating the tool schema or output. To mitigate this risk, it is recommended to implement static detection, strict enforcement, and contextual integrity checks for LLMs to ensure ...

Maintaining an Android app in Google Play Store is a lot of work

https://ashishb.net/img/android-logo.png
The user, an Android developer, shares their experience of maintaining a hobby app, MusicSync, and explains why the recent 47% decline in Google Play Store apps is not surprising due to the complexities and breaking changes in Android development. Maintaining an Android app requires more effort than server-side development, and hobby developers should consider the ongoing maintenance costs ...

Why not use DNS over HTTPS (DoH)?

DoH is not about protecting DNS queries from all peepers, but rather directing them to a single provider like Cloudflare. This can be problematic as it allows a single entity to collect all DNS queries.

FAA to eliminate floppy disks used in air traffic control systems

https://img.youtube.com/vi/0xhzApZS4q8/maxresdefault.jpg
The US FAA plans to upgrade its outdated air traffic control system to 21st century technology, replacing paper strips and floppy disks with modern systems. The project aims to complete the upgrade within four years, but industry experts consider this timeline unrealistic.

Air-dried vs. Kiln-dried Wood

https://substackcdn.com/image/fetch/w_520,h_272,c_fill,f_auto,q_auto:good,fl_progressive:steep,g_auto/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe4b59cc-778c-475c-8aeb-f853b7f1ea01_3089x1863.jpeg
Wood-drying processes involve air-drying and kiln-drying, with kiln-drying controlling temperature and humidity for faster drying, while air-drying is more variable and dependent on weather. Kiln-drying has evolved with technology, including digital controls and AI, to produce high-quality wood with low moisture content, while air-drying can produce relaxed and high-quality wood with proper control.

Knowledge Management in the Age of AI

The author used to be an Emacs enthusiast but stopped due to its complexity. They're now using Obsidian for personal knowledge management with the PARA method.

The Illusion of Thinking: Strengths and Limitations of Reasoning Models

https://mlr.cdn-apple.com/media/Home_1200x630_48225d82e9.png
Recent Large Reasoning Models (LRMs) show improved performance on reasoning benchmarks but their capabilities and limitations are insufficiently understood. LRMs face accuracy collapse beyond certain complexities and exhibit counter-intuitive scaling limits.

Show HN: Let’s Bend – Open-Source Harmonica Bending Trainer

https://letsbend.de/images/logo.png
The Let's Bend app helps harmonica beginners master bending notes through visual practice. It's available on desktop and Android, free of charge with no ads.

The wire that transforms much of Manhattan into one big, symbolic home (2017)

https://img.atlasobscura.com/7HrQjduxlsiFBz6DnG4bNssRATGfqugMN5BDw5WJ3d8/rs:fill:12000:12000/q:81/sm:1/scp:1/ar:1/aHR0cHM6Ly9hdGxh/cy1kZXYuczMuYW1h/em9uYXdzLmNvbS91/cGxvYWRzL2Fzc2V0/cy9hYzkxZTNhZTAz/N2EwMTM5NjNfRFND/XzA3NzYuanBn.jpg
Rabbi Moshe Tauber checks Manhattan's eruv wire daily to ensure it's intact before Shabbat, allowing observant Jews to carry objects between private and public spaces. The wire, a symbolic boundary, is maintained by the Jewish community with support from wealthy philanthropists to enable the community to observe their faith.

Reverse engineering Claude Code

https://kirshatrov.com/assets/layout-images/kirs.jpg
Claude Code is slower and more expensive due to its security policy and multiple LLM calls for each task. It provides a generic and secure UX, but can be improved with more efficient tool usage and pricing.

Binfmtc – binfmt_misc C scripting interface

https://www.netfort.gr.jp/~dancer/software/img/20050506-binfmtc1.png
User loves C programming and wants to use it for everyday scripting. A tool called binfmtc allows compiling and executing C scripts directly.

Re: My AI skeptic friends are all nuts

The author is concerned that relying too heavily on AI tools will lead to a loss of critical thinking skills and a decrease in the quality of code. They argue that humans should not outsource their problem-solving abilities completely.
1% of users clicked a feature, with 89% clicking the first position. Carousel testing failed, with users often missing or ignoring content.

Focus and Context and LLMs

The author criticizes agentic coding hype, stating that LLMs can only produce impressive results under intense algorithmic supervision. They highlight the difficulty of curating context for LLMs, making them only truly impressive in the hands of exceptional software engineers.

An innovative superfamily of fonts for code (2023)

https://monaspace.githubnext.com/opengraph-image.png?8f0e79eb8fe60cfc
Proportional typefaces have unique glyph widths, while monospaced faces have uniform widths, altering letter designs to fit. Texture healing replaces compromised pairs with more legible variants in monospaced fonts.

A Knockout Blow for LLMs?

https://substackcdn.com/image/fetch/w_520,h_272,c_fill,f_auto,q_auto:good,fl_progressive:steep,g_auto/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F16e2058f-a9b4-4404-8295-e0c09e005c38_1216x1502.png
Gary Marcus and Subbarao Kambhampati argue that large language models (LLMs) fail to reason reliably outside their training distribution, echoing Marcus's 1998 paper on neural networks and Kambhampati's critique of "chain of thought" models. The Apple paper shows that even the latest LLMs struggle to solve classic problems like the Tower of Hanoi, highlighting the need for well-specified ...

YouTuber claims to have received an offer to buy the Commodore brand

Sean Donohue's My Retro Computer Ltd acquired Commodore 64x rights, initially selling it as 'My 64' due to high licence fees. The company now offers it as 'Commodore 64x' after agreeing with Commodore Corporation B.V.

Fray: A Controlled Concurrency Testing Framework for the JVM

https://raw.githubusercontent.com/cmu-pasta/fray/main/assets/logo-cropped.png
Fray is a Java concurrency testing tool that helps find and debug race conditions using state-of-the-art techniques. It can be integrated into existing testing frameworks like JUnit 5 with the @ConcurrencyTest annotation.

Installing Microsoft Windows 98 in DOSBox-X

https://dosbox-x.com/wiki/images/Windows:Windows_98_SPLASH.png
To install Windows 98 in DOSBox-X, start by creating a harddisk image and configuring the win98.conf file, then mount the Windows 98 CD-ROM and follow the installation process. After installation, you can enable networking, adjust video and sound settings, and install additional software to enhance the Windows 98 experience in DOSBox-X.

Launching the BeOS on Hitachi Flora Prius Systems (1999)

To boot BeOS on a Hitachi FLORA Prius 330J, you can use a floppy disk, install a BeOS Launcher for Windows, or install Bootman, a boot manager that allows you to choose between operating systems. Bootman is included on the BeOS backup CD-ROM.

I got a remote job for a EU company, I'd find it hard to go back to a US-based

https://i.insider.com/6842cead9b2a601d01b27211?width=700
Meghan Gezo, a 34-year-old from Michigan, left her remote job in the US to work for Storyblok, a fully remote Austrian company. She found better work-life balance and benefits, including 16 weeks of paid maternity leave and a 38.5-hour workweek.

A Thousand Tiny Optimisations

https://leejo.github.io/images/2025/zelda/link-to-the-past.png
The author reminisces about playing A Link to the Past and discovers a randomizer that shuffles game items, creating infinite variations. The randomizer has evolved into a complex challenge with a large community and speedrunning aspect.

BYD's Five-Minute Charging Puts China in the Lead for EVs

https://spectrum.ieee.org/media-library/a-man-in-a-suit-speaking-on-stage-he-is-standing-between-two-passenger-cars-each-placed-on-the-far-right-and-far-left-side-of.jpg?id=60463254&width=1200&height=750
BYD's 1,000-kilowatt fast chargers can add 400 kilometers of range in five minutes, making EV charging as quick as gasoline fill-ups. This technology, demonstrated in China, could eliminate the biggest consumer gripe over EVs: long charging times.

Ask HN: In 15 years, what will a gas station visit look like?

Gas stations will change gradually with more EV chargers and possibly fewer staff due to automation. By 2040, they may resemble rest stops with fuel and charging, and some will remain for ICE engines.

Apple Is on Defense at WWDC

https://platform.theverge.com/wp-content/uploads/sites/2/2025/06/cook_wwdc_24.jpg?quality=90&strip=all&crop=0.17045454545455%2C0%2C99.659090909091%2C100&w=2400
Allison Johnson criticizes Apple's recent missteps, including slow Vision Pro sales and stalled Apple Intelligence features. She wants Apple to show humility at WWDC and acknowledge its mistakes.

The Diary of Samuel Pepys

https://www.historytoday.com/sites/default/files/2025-05/samuel_pepys_diary_censored_history_today.jpg
To continue reading this article you need to purchase a subscription, available from only £5. If you have already purchased access, or are a print & archive subscriber, please ensure you are logged in.

Ask HN: Does anyone know of a general news site akin to Hacker News?

You're looking for a platform that offers curated, non-tech news with commenting functionality, replacing Reddit, TikTok, and Google News. You want a community-driven site with thoughtful discussions, free from distractions and mainstream noise.

Nginx Restic Back End

https://www.grepular.com/images/amazon/mastering_nginx.jpg
Mike Cardwell used Restic for backups and created an Nginx configuration to act as a Rest Server, allowing append-only backups and administrative tasks. He used Nginx's DAV and LUA modules to make it compatible with Restic's REST API.

The printer that transcends dimensions and corrupts reality

https://ghuntley.com/content/images/size/w960/2025/06/An-epic--grand-scene-depicted-in-a-symbolic-traditional-tattoo-art-print-of-an-intergalactic-printer-with-weapons--traveling-through-space-and-opening-Stargate-wormholes--referencing-PC-load-letter.-Vibrant-colors--retro-flair--complex-orna.jpg
The user shares a story about a printer that arrived with eldritch symbols and had reality-bending features, causing chaos in the IT department, but they eventually learned to use its quirks productively. The user also shares their experience learning to work with a Large Language Model (LLM) by giving it free rein to propose solutions, leading to the development of a formally verified batch ...

Rohde and Schwarz AMIQ Modulation Generator Teardown

https://tomverbeure.github.io/assets/amiq/amiq_frontside.jpg
The user acquired a Rohde & Schwarz AMIQ, an I/Q modulation generator, for $45 in an auction and spent significant effort to restore it to working state. The device has a complex analog circuit design, including a programmable DAC clock, variable gain amplifier, and diagnostic signal generation, which the user plans to explore in future blog posts.

Tool to identify poisonous books developed by University of St Andrews

https://i.guim.co.uk/img/media/d16ecda789232b02a57b28ed810913332d0fa803/0_11_623_779/master/623.jpg?width=445&dpr=1&s=none&crop=none
A new device developed by the University of St Andrews can quickly and cheaply detect toxic pigment in old green books. The handheld tool shines different colors of light on a book to identify the unique reflectance pattern of emerald green pigment.

The Hashtable Packing Problem (2020)

https://backscattering.de/chess/hashtable-packing/sketch.png
The Hashtable Packing Problem is an NP-complete problem that involves packing multiple hashtables in memory without overlapping occupied buckets. It is strongly NP-complete, meaning no efficient algorithm exists to find optimal solutions.

Researchers discover evidence in the mystery of America's 'Lost Colony'

https://a57.foxnews.com/static.foxnews.com/foxnews.com/content/uploads/2024/09/340/340/Andrea-Vacchiano-057.jpg?ve=1&tl=1
Researchers Mark Horton and Scott Dawson found evidence that Roanoke Colony settlers assimilated into Native American society on Hatteras Island. They discovered hammerscale, iron-working evidence, and other English artifacts in a Native American trash heap, suggesting the colonists survived and integrated.

OpenBSD IO Benchmarking: How Many Jobs Are Worth It?

https://rsadowski.de/posts/2025/fio_simple_benckmarking/benchmark-read.png
OpenBSD 7.7 outperforms Linux 6.12 in disk I/O benchmarks with optimal job count between 6-8. More jobs do not always result in better performance due to trade-offs in latency and usability.

The race to find GPS alternatives

https://wp.technologyreview.com/wp-content/uploads/2025/06/Aerospacelab_IOV-ReadyToLaunch_1.jpeg?w=1366
A new satellite called Pulsar-0 is launching to test next-generation satnav technology for a constellation called Pulsar, which aims to provide more accurate and secure positioning signals than GPS. The Pulsar system will have 258 satellites in low Earth orbit, offering stronger signals and better resistance to jamming and spoofing.

Acetylated cellulose suppresses mass through commensals consuming carbohydrates

Please confirm you are a human by completing the captcha challenge below.

Show HN: Update to my meta glasses API "Hey Meta send a message to ChatGPT"

https://raw.githubusercontent.com/dcrebbin/meta-vision-api/main/assets/mai-promotional.png
Browser extension adds custom AI bots to Meta Messenger for use with Meta Rayban Smart Glasses or standalone app. It sends chat logs to providers like ChatGPT and responds with audio clips using OpenAI.

BYD's Five-Minute Charging Puts China in the Lead for EVs

https://spectrum.ieee.org/media-library/a-man-in-a-suit-speaking-on-stage-he-is-standing-between-two-passenger-cars-each-placed-on-the-far-right-and-far-left-side-of.jpg?id=60463254&width=1200&height=750
BYD's 1,000-kilowatt fast chargers can add 400 kilometers of range in five minutes, making EV charging as quick as gasoline fill-ups. This technology, demonstrated in China, could eliminate the biggest consumer gripe over EVs: long charging times.

Characterizing my first attempt at copper-only passives

The user is trying to measure small RF passives, specifically capacitors and an inductor, using a custom board and various test equipment, including a Siglent SVA1032X VNA and Scikit-RF tools. After revisiting the experiment with new calibration techniques and understanding of S-parameters, the user was able to obtain more accurate measurements and plots that resemble a capacitor and ...