Blog

Longer-form notes on research, scaling, video, 3D, and whatever else feels worth writing down.

Research

3 posts

Thoughts after reading Vincent Sitzmann's “A Bitter Lesson in Vision”.

An essay on why scalable data ecosystems reshape research directions, why 3D feels structurally hard to scale, and why video currently looks like the more natural pre-training frontier.

Read article 中文版 (RedNote)

On “Error Accumulation” in Causal AR World Models

Notes on train-test drift, self-forcing, teacher models, and why scale may change the whole conversation.

A longer reflection on error accumulation in causal world models, why LLMs seem much less affected, and why rollout length, teacher quality, and base-model scale are the real bottlenecks.

Read article 中文版 (RedNote)

On Long Context and Memory in Video World Models

Notes after working on Mixture of Contexts about learnable sparsity, memory, data, 3D, and unified models.

A collection of views on why learnable sparse attention is still underrated for video, what context should really mean in world models, and why data and unified training may matter more than new memory algorithms.

Read article 中文版 (RedNote)

Miscellaneous

1 post

How Heavy-Weight Individuals Can Quickly Unlock the 100kg Bench Press for Working Sets

Three mistakes that slowed me down much more than they should have.

A practical note on how heavier lifters can reach 100kg bench working sets faster by benching twice a week, simplifying training, and treating protein intake like a real part of the program.

Read article 中文版 (RedNote)