You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Oct 24, 2022. It is now read-only.
It does not matter how well we tune the regular expressions the method will always be subject to false-positives. One effective way to reduce the noise is to use an ML model for filtering (not detection).
To build the ML model, the following steps are required:
Download a large body of content known to produce false-positives (js files and other source code).
Run the current set of detectors to extract leaks (the generic secrets set is most suitable).
Use brain.js or an equivalent framework to train a model to spot the false-positives.
Compile the model.
Use the model to filter results from problematic detectors (again, the generic secrets is most suitable).