If the markdown note references Tags or Keywords, then extract to use in tag counts. Account for how this may have been referenced in the past. Here are a few textual examples.
AIOSEO Settings:
- Focus keyword = client attraction
- Additional keywords = knowledge worker, organize information
WordPress Post metadata:
- Category = Podcast
- Tags = information marketing, lead generation, demand generation
HTML Metadata notes:
**Keywords:** sales lead generation, business to business sales lead,
customer acquisition
YouTube post sidecar:
Tags:
information marketing, lead generation, demand generation
Write a check function for each of these to find the pattern. Once the pattern is found, then hand it off to the ngram lists before compiling.
These keywords for clustering will help determine what is in the notes. Have a command line option to --metadata false to avoid drawing these into a list.