The user attempted to post-train an open-source model, Kimi-K2-Thinking, but encountered several issues, including a slow compression step and out-of-memory errors, which were eventually resolved by enabling CUDA virtual memory and modifying the model's architecture. The user ultimately successfully trained the model, but found the process to be time-consuming and expensive, leading them to ...