Elicit Machine Learning Reading List

Purpose

The purpose of this curriculum is to help new Elicit employees learn background in machine learning, with a focus on language models. I’ve tried to strike a balance between papers that are relevant for deploying ML in production and techniques that matter for longer-term scalability.

If you don’t work at Elicit yet - we’re hiring ML and software engineers.

How to read

Fundamentals

Introduction to machine learning

Tier 1

Tier 2

Tier 3

Transformers

Tier 1

Tier 2

Tier 3

Tier 4+

Key foundation model architectures

Tier 1

Language Models are Unsupervised Multitask Learners (GPT-2)
Language Models are Few-Shot Learners (GPT-3)

Tier 2

✨ LLaMA: Open and Efficient Foundation Language Models (LLaMA)
✨ Efficiently Modeling Long Sequences with Structured State Spaces (video) (S4)
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (T5)
Evaluating Large Language Models Trained on Code (OpenAI Codex)
Training language models to follow instructions with human feedback (OpenAI Instruct)

Tier 3

✨ Mistral 7B (Mistral)
✨ Mixtral of Experts (Mixtral)
✨ Gemini: A Family of Highly Capable Multimodal Models (Gemini)
✨ Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Mamba)
Scaling Instruction-Finetuned Language Models (Flan)

Tier 4+

✨ Consistency Models
✨ Model Card and Evaluations for Claude Models (Claude 2)
✨ OLMo: Accelerating the Science of Language Models
✨ PaLM 2 Technical Report (Palm 2)
✨ Textbooks Are All You Need II: phi-1.5 technical report (phi 1.5)
✨ Visual Instruction Tuning (LLaVA)
A General Language Assistant as a Laboratory for Alignment
Finetuned Language Models Are Zero-Shot Learners (Google Instruct)
Galactica: A Large Language Model for Science
LaMDA: Language Models for Dialog Applications (Google Dialog)
OPT: Open Pre-trained Transformer Language Models (Meta GPT-3)
PaLM: Scaling Language Modeling with Pathways (PaLM)
Program Synthesis with Large Language Models (Google Codex)
Scaling Language Models: Methods, Analysis & Insights from Training Gopher (Gopher)
Solving Quantitative Reasoning Problems with Language Models (Minerva)
UL2: Unifying Language Learning Paradigms (UL2)

Training and finetuning

Tier 2

Tier 3

Tier 4+

ML in practice

Production deployment

Tier 1

Tier 2

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Elicit Machine Learning Reading List

Purpose

How to read

Table of contents

Fundamentals

Introduction to machine learning

Transformers

Key foundation model architectures

Training and finetuning

ML in practice

Production deployment

About

Uh oh!

Releases

Packages

yijinhua/machine-learning-list

Folders and files

Latest commit

History

Repository files navigation

Elicit Machine Learning Reading List

Purpose

How to read

Table of contents

Fundamentals

Introduction to machine learning

Transformers

Key foundation model architectures

Training and finetuning

ML in practice

Production deployment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages