LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

https://www.gilesthomas.com/images/x-icon.png
The user attempted to train a GPT-2 small sized base model on their own hardware using the Hugging Face FineWeb-series datasets and achieved a level of performance almost as good as the original in just over 48 hours. They used the Chinchilla heuristic to determine the optimal number of tokens to train for, which resulted in a 3.2B token training run that could be completed in 44 hours using ...

The Joy of Playing Grandia, on Sega Saturn

https://i0.wp.com/www.segasaturnshiro.com/wp-content/uploads/2025/11/GrandiaReview_banner.png?resize=728%2C409&ssl=1
The Saturn is experiencing a renaissance due to dedicated translator teams unlocking Japanese titles for new audiences. Grandia, a text-heavy RPG, is one such title that has been translated for the Saturn, offering a story-driven experience with 3D environments and a complex combat system.

No ARIA is better than bad ARIA

https://www.w3.org/WAI/assets/images/social-sharing-default.jpg
ARIA roles, states, and properties provide accessibility semantics for assistive technologies, ensuring non-visual experiences match visual ones. Incorrect ARIA use can misrepresent visual experiences, causing devastating effects on non-visual experiences.

Show HN: AlgoDrill – Interactive drills to stop forgetting LeetCode patterns

https://algodrill.io/finallogo.png
NeetCode helps students master LeetCode problems through guided steps and fill-in-the-code drills to turn solutions into automatic recall. This approach is used by top students at universities like UC Berkeley and Georgia Tech to land offers at FAANG+ companies.

Epsilon: A WASM virtual machine written in Go

https://opengraph.githubassets.com/ca1f1c8641ffc75e9117717d74c05c2397375d0395529743d4b4c7ac29540ab1/ziggy42/epsilon
This code reads a WASM file, instantiates it, and invokes an exported function. It also demonstrates how to create and use imports with the epsilon WASM runtime.

Icons in Menus Everywhere – Send Help

https://cdn.jim-nielsen.com/blog/2025/context-menu-sheets.png
The author dislikes default icons in menu items, finding them unnecessary noise, and prefers a thoughtful approach to icon usage. They argue that Apple's new approach goes against their own previous human interface guidelines.

The universal weight subspace hypothesis

https://arxiv.org/static/browse/0.3.4/images/arxiv-logo-fb.png
Deep neural networks trained on diverse tasks converge to shared low-dimensional subspaces. This shared structure has implications for model reusability, multi-task learning, and developing more efficient algorithms.

Kroger acknowledges that its bet on robotics went too far

https://d1b6lhn2ymmy1x.cloudfront.net/journalist-headshots/silverstein-sam-circle-150x150.png
Kroger is closing three robotic e-commerce facilities and refocusing on its stores due to disappointing performance. The company will incur $2.6 billion in charges but expects a $400 million boost in e-commerce profitability.

Manual: Spaces

https://cdn-gc.type.today/storage/post/8/830/preview_preview_image_ru-lLth2ELpSRNTy-Q9wZ_8RHHj4w3aErjjLA.jpg
Whitespace is a crucial typographic element that affects text rhythm and organization, and its width can vary depending on the typeface, point size, and line width. Proper use of whitespace, including word spacing, non-breaking spaces, and additional spaces, is essential for creating a clear and visually appealing design.

A deep dive into QEMU: The Tiny Code Generator (TCG), part 1

This blog post details some internals of the QEMU TCG engine, the machinery responsible for executing target instructions on the host. You should have already read Execution loop and Breakpoints handling blog posts to have some pointers. As previously explained, tb_gen_code will generate intermediate representation (IR) code thanks to gen_intermediate_code and then host architecture assembly ...

Jepsen: NATS 2.12.1

https://jepsen.io/analyses/nats-2.12.1/blk-bitflip-loss.png
NATS JetStream promises "at least once" delivery but acknowledged messages may be lost due to file corruption, power failures, or coordinated node failures. The system's default fsync interval of every two minutes can lead to data loss when nodes experience correlated power failure or fail in rapid succession.

Strong earthquake hits northern Japan, tsunami warning issued

https://www3.nhk.or.jp/nhkworld/upld/thumbnails/en/news/20251209_02_1598995_L.png
A 7.6 magnitude earthquake struck off Aomori Prefecture's eastern coast at 11:15 p.m. on Monday, causing strong tremors and a tsunami warning for northern Japan's Pacific coastline.

Torture Techniques from CIA Black Sites Were Used at Alligator Alcatraz

https://www.forever-wars.com/content/images/size/w2000/2025/12/Screenshot-2025-12-08-at-1.58.33---PM-1.png
The CIA used the Confinement Box, a coffin-shaped box, to torture Abu Zubaydah for 20 days in 2002. The technique was later used in Florida's Alligator Alcatraz facility to punish migrants.

Microsoft increases Office 365 and Microsoft 365 license prices

https://i0.wp.com/office365itpros.com/wp-content/uploads/2025/12/Microsoft-365-Prices-July-1-2026.jpg?resize=732%2C831&ssl=1
Microsoft announced price increases for Microsoft 365 monthly licenses, effective July 1, 2026, with increases ranging from zero to $3 per month. The price changes could bring in an extra $10.7 billion, offsetting investments in AI and datacenter capacity.

AMD GPU Debugger

https://thegeeko.me/open-graph/amd-gpu-debugging.png
The user is trying to create a GPU debugger similar to CPU debuggers, and they have made progress by creating a basic shader that runs on the GPU and pausing its execution using a trap handler. They plan to integrate their debugger with RADV and use its compiler to run SPIR-V binaries, and they have ideas for implementing features like instruction-level stepping and breakpoints.

Has the cost of building software dropped 90%?

https://martinalderson.com/img/cost_of_shipping@2x.png
The software development industry is on the cusp of a once-in-a-generation shift with agentic coding tools that can transform the way software is built, making it faster and cheaper. This change will create new opportunities for developers who master these tools, but also requires a mindset shift and a willingness to adapt to the rapidly improving capabilities of AI coding tools.

A thousand-year-long composition turns 25 (2024)

https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee832e27-817d-4c48-868f-e781fb6633aa_2048x1878.jpeg
Longplayer has been playing continuously for 25 years since midnight on December 31 1999, and will continue until December 31 2999. It's a unique composition that inspires long-term thinking and connects an international community through time.

Launch HN: Nia (YC S25) – Give better context to coding agents

We're verifying your browser Website owner? Click here to fix

Horses: AI progress is steady. Human equivalence is sudden

https://andyljones.com/source/horses/horse_efficiency.png
The speaker discussed how AI surpassed human capabilities in chess and job automation, with Claude answering 30,000 questions a month. This rapid progress is comparable to the replacement of horses by mechanical engines.

Let's put Tailscale on a jailbroken Kindle

https://cdn.sanity.io/images/w77i7m8x/production/7bbe7afbd47ed73302e8e97c51311c31406e10d5-1000x750.png?w=2048&q=75&fit=clip&auto=format
A user jailbreaks their Kindle to run unofficial apps and access more e-books. They use Tailscale to connect their Kindle to their network and access files and a command line.

Trials avoid high risk patients and underestimate drug harms

https://www.nber.org/sites/default/files/styles/promo/public/2025-07/MF%20Lecture%202025%20updated.png?itok=ij7zY5fj
Researchers Jason Abaluck, Leila Agha, and Sachin Shah received funding to study oncology trials and their impact on patient outcomes. Their work was supported by grants from NIA, Alfred P. Sloan Foundation, and the Lynde and Harry Bradley Foundation.

IBM to acquire Confluent

https://images.ctfassets.net/8vofjvai1hpv/7GXh7qhoiRcHk8Fjip1amV/c0b47a9ad7aaef26ffe6bf188d9e26d7/Tech-Confluent-General.png
Confluent has agreed to be acquired by IBM in an all-cash deal for $31.00 per share. Confluent will continue to operate as a distinct brand within IBM after the transaction closes.

Paramount launches hostile bid for Warner Bros

https://image.cnbcfm.com/api/v1/image/108238296-17652040561765204053-42884714451-1080pnbcnews.jpg?v=1765204055&w=750&h=422&vtcrop=y
Paramount is launching a hostile bid to buy Warner Bros. Discovery for $30 per share after losing out in a bidding war to Netflix. The bid is backed by equity financing and debt commitments, with Paramount arguing it will be a more competitive option than a Netflix-WBD merger.

The Lost Machine Automats and Self-Service Cafeterias of NYC (2023)

https://www.untappedcities.com/content/images/wp-content/uploads/2022/10/marvelous-mrs-maisel-filming-locations-horn-and-hardart-set-amazon-studios-crown-heights-brooklyn-nyc2-1.jpg
Automats and self-service cafeterias were a staple of New York City's dining scene in the 20th century, offering affordable food and social spaces. Many iconic locations, such as Horn & Hardart and Dubrow's Cafeteria, have closed, but their legacy lives on in New York City culture.

Periodic Spaces

https://ianthehenry.com/posts/periodic-spaces/hero.png
The user explains how to render an infinite number of shapes in real-time using signed distance functions and ray marching, which involves evaluating one shape at a time and using the distance field to decide how far to advance. The user also discusses domain repetition, a technique that allows for discrete tiling and periodic functions of space, which can be used to create smooth tiling ...

Hunting for North Korean Fiber Optic Cables

https://nkinternet.com/wp-content/uploads/2025/12/d3e27ea5-25f1-49ed-9392-e38cb466b732.png?w=1024
The author analyzed a PowerPoint presentation by the DPRK on their aviation industry and ADS-B deployment, which led to research on North Korea's fiber optic network. The network likely runs along the east coast, following major roads and railways, with a possible route from Pyongyang to Wonsan to Hamhung.

Cassette tapes are making a comeback?

https://images.theconversation.com/files/706283/original/file-20251204-56-ym2e5n.png?ixlib=rb-4.1.0&q=45&auto=format&w=754&fit=clip
Cassette sales are increasing despite being fragile and low-quality, with major artists releasing music on the format. This trend is seen as a rediscovery or discovery of cassettes, rather than a full-blown revival, offering a tactile connection and resistance to digital dominance.

AI should only run as fast as we can catch up

The user discusses the problem of reliable engineering with AI, where one needs to catch up with AI's work as quickly as possible to ensure its reliability and trustworthiness. This can be broken down into two parts: learning/creation and verification, and the user proposes the concept of Verification Engineering to make verification of AI performed tasks easier and more effective.

Show HN: Fanfa – Interactive and animated Mermaid diagrams

Paste Mermaid, watch it breathe with cinematic SVG animation.

Microsoft Download Center Archive

https://legacyupdate.net/banner.png
Legacy Update archives Microsoft downloads removed between 2012 and 2025, including Windows, Office, and SQL Server. Users can browse or search the archive, but downloads may have security vulnerabilities and require updates.