Skip to content

My log book for my findings as I build and train different neural networks.

Notifications You must be signed in to change notification settings

2kbits/ML-Experiments

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 

Repository files navigation

ML-Experiments

About This Repository

This is my personal logbook for machine learning and AI experiments. As I build, train, and evaluate different neural network architectures, I document my findings, challenges, and breakthroughs here. This repository serves both as my learning journal and a showcase of my growing expertise in AI development.

Current Focus

I'm particularly interested in language models and exploring how to make them more efficient while maintaining performance. My experiments span various architectures and training approaches, with a focus on what can be accomplished with consumer-grade hardware.

Project Log

[02/26/2025] TinyStories Replication Experiment

  • Paper: "TinyStories: How Small Can Language Models Be and Still Speak Coherent English?"
  • Goal: Replicate the paper's approach using only my GTX 3090
  • Current Status: Reading the paper and planning implementation strategy
  • Technical Setup:
    • Hardware: GTX 3090 (24GB VRAM)
    • Framework: PyTorch
  • Key Questions:
    • How small can I make a coherent LLM with my hardware constraints?
    • What optimizations will be necessary for efficient training?
    • How will performance compare to the original paper's results?

Future Directions

  • Exploring quantization techniques
  • Implementing attention mechanism variations
  • Experimenting with different tokenization strategies
  • Testing alternative training datasets

Contact & Collaboration

I'm always open to collaboration, feedback, or discussions about ML/AI. Feel free to reach out if you have any questions about my experiments or want to collaborate on a project.


This repository is actively maintained and updated by me as I continue my AI learning journey.

About

My log book for my findings as I build and train different neural networks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors