Notes

A running log of what I produce and what I consume.

2026
Recursive Language Models
2025
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought
2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
2025
Zep: A Temporal Knowledge Graph Architecture for Agent Memory
2025
How to Tame Your LLM: Semantic Collapse in Continuous Systems
2025
Memory in the Age of AI Agents
2024
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
2024
Mixtral of Experts
2024
ARC Prize 2024: Technical Report
2024
The Llama 3 Herd of Models
2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
2024
DeepSeek-V2 Technical Report
2024
DeepSeek-V3 Technical Report
2024
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
2024
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
2024
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
2023
DINOv2: Learning Robust Visual Features without Supervision
2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
2023
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
2023
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
2023
Self-Consistency Improves Chain of Thought Reasoning in Language Models
2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
2023
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
2023
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
2023
Let's Verify Step by Step
2023
Efficient Memory Management for Large Language Model Serving with PagedAttention
2023
Fast Inference from Transformers via Speculative Decoding
2023
Toolformer: Language Models Can Teach Themselves to Use Tools
2023
GPT-4 Technical Report
2023
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
2023
Steering Language Models With Activation Engineering
2022
Training Compute-Optimal Large Language Models
2022
Training language models to follow instructions with human feedback
2022
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
2022
Constitutional AI: Harmlessness from AI Feedback
2022
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
2022
Large Language Models are Zero-Shot Reasoners
2022
ReAct: Synergizing Reasoning and Acting in Language Models
2022
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
2021
Training data-efficient image transformers & distillation through attention
2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
2021
LoRA: Low-Rank Adaptation of Large Language Models
2020
Scaling Laws for Neural Language Models
2020
Language Models are Few-Shot Learners
2020
Learning to summarize from human feedback
2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
2019
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
2017
Attention Is All You Need
2017
Deep Reinforcement Learning from Human Preferences
2017
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm