- Faster LLMs: Improving RWKV with Parallel Cumulative Sums
- Simple Fast Attention: Causal Implementation Experiments
- Negative Gearing and the CGT Discount: A Modern Portfolio Theory Analysis
- Retention LLMs: Analysing Algorithms and Alternative Implementations
- Digging into Doherty: Implications of Initialization
Modelling 2
- How the Omicron Wave will be Different Dec 22, 2021
- Digging into Doherty: Implications of Initialization Aug 25, 2021
