You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Mine a markdown corpus for articles of at least 300 words, mature enough for publishing. I have a lot of draft articles. It would need to parse YAML front matter to identify candidates for recycling, too.
Maturity would be determined by a readability index, looking only at the H1 and H2 as well as the body. Some drafts read better than others. Ignore long reports, try to fund unpublished drafts.
Once content is found, it is digested against a root keyword. Then a cluster of related documents was found. Go back into the markdown corpus to report supporting materials if desired. More of the same content that can be sourced.
The tool can work with a pre-determined root keyword or find content based on a certain length.
USER STORY: "User needs a 400-word article by 5 pm on business development, to go with a recent podcast episode. Run the tool against a directory of drafts, and it returns a short list of potential articles. Reporting file path, topic, scoring, word length, and keywords."
An option to print the article to standard output so that it can be ingested by something else. Make sure the command is stackable.