Blogs – Simon Guo

Accelerating AI Research that Accelerates AI Research

Feb 2026 collab

Scaling automated GPU kernel generation and RL training for ML engineering tasks on elastic sandbox compute infrastructure, in collaboration with Modal.

Supercharging LLMs: Scalable RL with torchforge and Weaver

Jan 2026 collab

Scaling PyTorch-native RL with torchforge on 512 GB200, collaboration with PyTorch and CoreWeave

Towards Automated GPU Kernel Generation

Oct 2025 author

A 1-year retrospective on KernelBench / progress towards automated GPU Kernel Generations

NVIDIA Tensor Core Evolution: From Volta To Blackwell

Jun 2025 collab

Amdahl’s Law, Strong Scaling, Asynchronous Execution, Blackwell, Hopper, Ampere, Turing, Volta, TMA | with SemiAnalysis

Ring Attention Explained

Apr 2024 author

How do state-of-the-art LLMs like Gemini 1.5 and Claude 3 scale to long context windows beyond 1M tokens? Well, Ring Attention presents a way to split attention calculation across GPUs while hiding the communication overhead in a ring,enabling zero overhead scaling.

How Cohere uses Ray along with JAX and TPUv4 to train Large Language Models

Dec 2022 collab

Blog I helped write at Anyscale. The FAX system at Cohere enables distributed training of Large Language Model using JAX's pjit tensor parallelism, TPUv4s' pods of accelerators with high-speed interconnect, and Ray's powerful yet simple worker orchestration.

Coming Soon

RayRTC: Applying Ray (Large-Scale Distributed Compute Engine for ML) to ByteDances’ NLP Workloads

Jul 2022 collab

Blog I helped write at Anyscale. This explains how ByteDances uses Ray Dataset and Train to build their NLP pipeline for products such as TikTok.

Coming Soon

Hegel: The Realization of Moral Instruction via Folk Religion

Dec 2021 personal

In Hegel’s “The Tübingen Essay”, Hegel proposes the importance of folk religion, which itself is constituted “with respect to ceremonies”. However, this seems to deviate from German Protestant (particularly Luther’s) thoughts’ de-emphasis on rituals, which is often deemed as too Jewish or Catholic. Thus, does Hegel deviate from Luther’s “sola fide” (by faith alone)?

Temporal Authority: Contradiction or Unification

Oct 2021 personal

In Prefaces to the Old Testament, Luther explains the causes for the extraneous laws in the Old Testament, showing the impossibility to obey laws leads to the yearning for God’s Grace. Thus, for Christians, faith matters more than following laws. However, Luther in Temporal Authority points out that “True Christians”, although do not need laws, should follow the extensive set of temporal laws. There seems to be a contradiction: if only “faith and love” matters, why shall Christians still follow temporal laws?

Droid turned Devil

Dec 2020 personal

Serve in heaven or reign in hell? Which is it to be? Roots of Romantic Satanism in Ridley Scott’s Prometheus (2012) and its sequel Alien: Covenant (2017).

First Year EECS Projects @UC Berkeley

Aug 2020 author

Debug, debug, debug… Somehow it all worked!

Love and Duty

Dec 2019 personal

How does love appear as a potent and potentially dangerous emotion and force in Virgil's Aeneid and Lucretius' The Way Things Are?

Yugoslav-Soviet Split of 1948: The “Impossible” Yet Unavoidable

May 2019 personal

Coming Soon

Gadgets I made in high school

Feb 2019 personal

Small fun projects that I made with my peers

CES 2018: Smart City, Smart Home, VR&AR and More

Jan 2018 author

What I saw at Consumer Electronics Show 2018

Shenzhen, the Hardware Paradise

Nov 2017 author

My Experience at TechCrunch Shenzhen 2017 Hackathon, Conference, and Innovation Tour

The Marshall Plan: Generosity or Imperialism?

May 2017 personal

“Our policy is directed not against any country or doctrine but against hunger, poverty, desperation, and chaos.”, said General George C. Marshall.. but is it really just that?

Avro Arrow: An Untimely Legend

Apr 2015 personal

Mach 2, 50,000 Feet.. Why did the superior Canadian homegrown fighter jet got cancelled?

Accelerating AI Research that Accelerates AI Research

Read More

Supercharging LLMs: Scalable RL with torchforge and Weaver

Read More

Towards Automated GPU Kernel Generation

Read More

NVIDIA Tensor Core Evolution: From Volta To Blackwell

Read More

Ring Attention Explained

Read More

How Cohere uses Ray along with JAX and TPUv4 to train Large Language Models

Coming Soon

RayRTC: Applying Ray (Large-Scale Distributed Compute Engine for ML) to ByteDances’ NLP Workloads

Coming Soon

Hegel: The Realization of Moral Instruction via Folk Religion

Read More

Temporal Authority: Contradiction or Unification

Read More

Droid turned Devil

Read More

First Year EECS Projects @UC Berkeley

Read More

Love and Duty

Read More

Yugoslav-Soviet Split of 1948: The “Impossible” Yet Unavoidable

Coming Soon

Gadgets I made in high school

Read More

CES 2018: Smart City, Smart Home, VR&AR and More

Read More

Shenzhen, the Hardware Paradise

Read More

The Marshall Plan: Generosity or Imperialism?

Read More

Avro Arrow: An Untimely Legend

Read More