A Scala / Java / Python library for anonymization, encryption and redaction operations for large datasets on Apache Spark.
It is currently maintained by a team of volunteers.
Post questions and comments to the Google group, or email them directly to data-commons-toolchain@googlegroups.com.
Docs are available at http://data-commons.github.io/protectr Or check out the Scaladoc, Javadoc, or Python doc.
- Latest Stable 0.5. Coming soon!
- Coming Soon!