LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

https://www.gilesthomas.com/images/x-icon.png
The user attempted to train a GPT-2 small sized base model on their own hardware using the Hugging Face FineWeb-series datasets and achieved a level of performance almost as good as the original in just over 48 hours. They used the Chinchilla heuristic to determine the optimal number of tokens to train for, which resulted in a 3.2B token training run that could be completed in 44 hours using ...

The Joy of Playing Grandia, on Sega Saturn

https://i0.wp.com/www.segasaturnshiro.com/wp-content/uploads/2025/11/GrandiaReview_banner.png?resize=728%2C409&ssl=1
The Saturn is experiencing a renaissance due to dedicated translator teams unlocking Japanese titles for new audiences. Grandia, a text-heavy RPG, is one such title that has been translated for the Saturn, offering a story-driven experience with 3D environments and a complex combat system.

Transformers know more than they can tell: Learning the Collatz sequence

Transformers learn the long Collatz step in a sequence of discrete steps, where specific classes of input are learned and predicted with high accuracy. The learning pattern is independent of the base and corresponds to a deep mathematical property of the Collatz sequence.

Why frozen test fixtures are a problem on large projects and how to avoid them

https://radanskoric.com/assets/img/posts/covers/frozen-test-fixtures.png
Fixtures can be problematic in large test suites due to their reusability, which can lead to frozen fixtures. This can be minimized by writing tests that focus on specific properties of the code and avoid making assumptions about fixtures.

Constructing the Word's First JPEG XL MD5 Hash Quine

https://stackchk.fail/blog/shark_hashquine.jxl.png
A Hash Quine is a program that prints its own cryptographic hash, and the user created a JPEG XL image that displays its own MD5 hash using MD5 collision blocks and a prediction tree. The image can be modified to encode any 128-bit pattern by swapping the MD5 collision blocks.

Icons in Menus Everywhere – Send Help

https://cdn.jim-nielsen.com/blog/2025/context-menu-sheets.png
The author dislikes default icons in menu items, finding them unnecessary noise, and prefers a thoughtful approach to icon usage. They argue that Apple's new approach goes against their own previous human interface guidelines.

ZX Spectrum Next on the Internet: Xberry Pi ESP01 and Pi Zero Upgrades

https://retrogamecoders.com/wp-content/uploads/2025/12/ZX-Spectrum-Next-on-the-Internet.png
The user upgraded their Xberry Pi ZX Spectrum Next with a Pi Zero accelerator and ESP 8266 Wifi module, but encountered issues with the ESP firmware update. They found that not updating the ESP 01 firmware was the key to getting it working.

Show HN: AlgoDrill – Interactive drills to stop forgetting LeetCode patterns

https://algodrill.io/finallogo.png
NeetCode helps students master LeetCode problems through guided steps and fill-in-the-code drills to turn solutions into automatic recall. This approach is used by top students at universities like UC Berkeley and Georgia Tech to land offers at FAANG+ companies.

Rahm Emanuel says U.S. should follow Australia's youth social media ban

A deep dive into QEMU: The Tiny Code Generator (TCG), part 1 (2021)

QEMU TCG engine executes target instructions on the host using intermediate representation (IR) code. It has frontend and backend operations, translating target instructions to IR with functions like translate_insn and gen_intermediate_code.

Epsilon: A WASM virtual machine written in Go

https://opengraph.githubassets.com/ca1f1c8641ffc75e9117717d74c05c2397375d0395529743d4b4c7ac29540ab1/ziggy42/epsilon
This code reads a WASM file, instantiates it, and invokes an exported function. It also demonstrates how to create and use imports with the epsilon WASM runtime.

The universal weight subspace hypothesis

https://arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png
Deep neural networks trained on diverse tasks converge to shared low-dimensional subspaces. This shared structure has implications for model reusability, multi-task learning, and developing more efficient algorithms.

Kroger acknowledges that its bet on robotics went too far

https://d1b6lhn2ymmy1x.cloudfront.net/journalist-headshots/silverstein-sam-circle-150x150.png
Kroger is closing three robotic e-commerce facilities and refocusing on its stores due to disappointing performance. The company will incur $2.6 billion in charges but expects a $400 million boost in e-commerce profitability.

Brent's Encapsulated C Programming Rules (2020)

The user provides C programming rules focusing on encapsulation, memory management, and code readability, emphasizing the importance of explicit memory ownership and avoiding void pointers. They also discuss best practices for string handling, testing, and code organization, recommending the use of standard library functions and avoiding unnecessary complexity.

Show HN: I built a system for active note-taking in regular meetings like 1-1s

Manual: Spaces

https://cdn-gc.type.today/storage/post/8/830/preview_preview_image_ru-lLth2ELpSRNTy-Q9wZ_8RHHj4w3aErjjLA.jpg
Whitespace is a crucial typographic element that affects text rhythm and organization, and its width can vary depending on the typeface, point size, and line width. Proper use of whitespace, including word spacing, non-breaking spaces, and additional spaces, is essential for creating a clear and visually appealing design.

Jepsen: NATS 2.12.1

https://jepsen.io/analyses/nats-2.12.1/blk-bitflip-loss.png
NATS JetStream promises "at least once" delivery but acknowledged messages may be lost due to file corruption, power failures, or coordinated node failures. The system's default fsync interval of every two minutes can lead to data loss when nodes experience correlated power failure or fail in rapid succession.

Strong earthquake hits northern Japan, tsunami warning issued

https://www3.nhk.or.jp/nhkworld/upld/thumbnails/en/news/20251209_02_1598995_L.png
A 7.6 magnitude earthquake struck off Aomori Prefecture's eastern coast at 11:15 p.m. on Monday, causing strong tremors and a tsunami warning for northern Japan's Pacific coastline.

Mazda suitcase car, a portable three-wheeled vehicle that fits in the luggage

https://static.designboom.com/wp-content/uploads/2025/12/mazda-suitcase-car-portable-three-wheeled-vehicle-designboom-500.jpg
Mazda engineers built a portable three-wheeled vehicle in the early 1990s to move around airports faster. The suitcase car weighed 32 kilos and reached 30 km/h, but never made it to production.

Microsoft increases Office 365 and Microsoft 365 license prices

https://i0.wp.com/office365itpros.com/wp-content/uploads/2025/12/Microsoft-365-Prices-July-1-2026.jpg?resize=732%2C831&ssl=1
Microsoft announced price increases for Microsoft 365 monthly licenses, effective July 1, 2026, with increases ranging from zero to $3 per month. The price changes could bring in an extra $10.7 billion, offsetting investments in AI and datacenter capacity.

Has the cost of building software dropped 90%?

https://martinalderson.com/img/cost_of_shipping@2x.png
The software development industry is on the cusp of a once-in-a-generation shift with agentic coding tools that can transform the way software is built, making it faster and cheaper. This change will create new opportunities for developers who master these tools, but also requires a mindset shift and a willingness to adapt to the rapidly improving capabilities of AI coding tools.

No ARIA is better than bad ARIA

https://www.w3.org/WAI/assets/images/social-sharing-default.jpg
ARIA roles, states, and properties provide accessibility semantics for assistive technologies, ensuring non-visual experiences match visual ones. Incorrect ARIA use can misrepresent visual experiences, causing devastating effects on non-visual experiences.

Let's put Tailscale on a jailbroken Kindle

https://cdn.sanity.io/images/w77i7m8x/production/7bbe7afbd47ed73302e8e97c51311c31406e10d5-1000x750.png?w=2048&q=75&fit=clip&auto=format
A user jailbreaks their Kindle to run unofficial apps and access more e-books. They use Tailscale to connect their Kindle to their network and access files and a command line.

Launch HN: Nia (YC S25) – Give better context to coding agents

We're verifying your browser Website owner? Click here to fix

AMD GPU Debugger

https://thegeeko.me/open-graph/amd-gpu-debugging.png
The user is trying to create a GPU debugger similar to CPU debuggers, and they have made progress by creating a basic shader that runs on the GPU and pausing its execution using a trap handler. They plan to integrate their debugger with RADV and use its compiler to run SPIR-V binaries, and they have ideas for implementing features like instruction-level stepping and breakpoints.

Trials avoid high risk patients and underestimate drug harms

https://www.nber.org/sites/default/files/styles/promo/public/2025-07/MF%20Lecture%202025%20updated.png?itok=ij7zY5fj
Researchers Jason Abaluck, Leila Agha, and Sachin Shah received funding to study oncology trials and their impact on patient outcomes. Their work was supported by grants from NIA, Alfred P. Sloan Foundation, and the Lynde and Harry Bradley Foundation.

IBM to acquire Confluent

https://images.ctfassets.net/8vofjvai1hpv/7GXh7qhoiRcHk8Fjip1amV/c0b47a9ad7aaef26ffe6bf188d9e26d7/Tech-Confluent-General.png
Confluent has agreed to be acquired by IBM in an all-cash deal for $31.00 per share. Confluent will continue to operate as a distinct brand within IBM after the transaction closes.

Horses: AI progress is steady. Human equivalence is sudden

https://andyljones.com/source/horses/horse_efficiency.png
The speaker discussed how AI surpassed human capabilities in chess and job automation, with Claude answering 30,000 questions a month. This rapid progress is comparable to the replacement of horses by mechanical engines.

30 Year Anniversary of WarCraft II: Tides of Darkness

https://www.jorsys.org/archive/december_2025-files/war2tod.jpg
It has now been 30 years since WarCraft II: Tides of Darkness was released. After the great response to Warcraft: Orcs and Humans, released in November 1994, Blizzard began working on Warcraft II: Tides of Darkness. Development stared in the first months of 1995, and the game was released in North America and Australia on December 9, 1995. While WarCraft: Orcs and Humans had laid the ...

Hunting for North Korean Fiber Optic Cables

https://nkinternet.com/wp-content/uploads/2025/12/d3e27ea5-25f1-49ed-9392-e38cb466b732.png?w=1024
The author analyzed a PowerPoint presentation by the DPRK on their aviation industry and ADS-B deployment, which led to research on North Korea's fiber optic network. The network likely runs along the east coast, following major roads and railways, with a possible route from Pyongyang to Wonsan to Hamhung.