Notes
- 2026
Recursive Language Models
- 2025
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought
- 2025
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
- 2025
Zep: A Temporal Knowledge Graph Architecture for Agent Memory
- 2025
How to Tame Your LLM: Semantic Collapse in Continuous Systems
- 2025
Memory in the Age of AI Agents
- 2024
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
- 2024
Mixtral of Experts
- 2024
ARC Prize 2024: Technical Report
- 2024
The Llama 3 Herd of Models
- 2024
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
- 2024
DeepSeek-V2 Technical Report
- 2024
DeepSeek-V3 Technical Report
- 2024
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
- 2024
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
- 2024
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
- 2023
DINOv2: Learning Robust Visual Features without Supervision
- 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
- 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
- 2023
GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints
- 2023
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
- 2023
Self-Consistency Improves Chain of Thought Reasoning in Language Models
- 2023
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
- 2023
Graph of Thoughts: Solving Elaborate Problems with Large Language Models
- 2023
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines
- 2023
Let's Verify Step by Step
- 2023
Efficient Memory Management for Large Language Model Serving with PagedAttention
- 2023
Fast Inference from Transformers via Speculative Decoding
- 2023
Toolformer: Language Models Can Teach Themselves to Use Tools
- 2023
GPT-4 Technical Report
- 2023
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
- 2023
Steering Language Models With Activation Engineering
- 2022
Training Compute-Optimal Large Language Models
- 2022
Training language models to follow instructions with human feedback
- 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
- 2022
Constitutional AI: Harmlessness from AI Feedback
- 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
- 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
- 2022
Large Language Models are Zero-Shot Reasoners
- 2022
ReAct: Synergizing Reasoning and Acting in Language Models
- 2022
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
- 2021
Training data-efficient image transformers & distillation through attention
- 2021
Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
- 2021
LoRA: Low-Rank Adaptation of Large Language Models
- 2020
Scaling Laws for Neural Language Models
- 2020
Language Models are Few-Shot Learners
- 2020
Learning to summarize from human feedback
- 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
- 2019
Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
- 2017
Attention Is All You Need
- 2017
Deep Reinforcement Learning from Human Preferences
- 2017
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm