now that new features are detected, we need a new classifier to classify each section as boilerplate or content