-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Description
Goal
Build TF-IDF search for a small document corpus (product descriptions from FasterShops).
Learn
- Term frequency calculation
- Inverse document frequency and why it matters
- Sparse vector representation
- Cosine similarity for ranking
Deliverable
- Python notebook with implementation
- TypeScript implementation
- Comparison on sample queries
Proof Point
Can explain why "the" gets low weight and rare terms get high weight.
Directory
search-fundamentals/01-tfidf-from-scratch/
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels