Skip to content

docs: add Troubleshooting section to README#186

Open
tykoo-chen wants to merge 1 commit intokarpathy:masterfrom
tykoo-chen:docs/troubleshooting
Open

docs: add Troubleshooting section to README#186
tykoo-chen wants to merge 1 commit intokarpathy:masterfrom
tykoo-chen:docs/troubleshooting

Conversation

@tykoo-chen
Copy link

Problem

New users often encounter common issues that are easy to solve but not documented.

Solution

Add a Troubleshooting section covering:

  • No CUDA GPU detected - points to MLX forks for Mac, CUDA drivers for others
  • CUDA out of memory - suggests reducing DEVICE_BATCH_SIZE and DEPTH
  • kernels/Flash Attention errors - suggests uv sync --reinstall
  • Loss not decreasing - explains warmup behavior
  • Script hangs at startup - explains torch.compile overhead

This should reduce issues/questions from new users.

Common issues and solutions for new users:
- No CUDA GPU detected
- CUDA out of memory
- kernels/Flash Attention errors
- Loss not decreasing
- Script hangs at startup
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant