ARX is a comprehensive open source data anonymization tool aiming to provide scalability and usability.

dependabot[bot] d2db9ec439 Bump sqlite-jdbc from 3.7.2 to 3.41.2.2		1 år sedan
.github	ebfb4b72fc Update pmd.yml	1 år sedan
.idea	d029d800db Implement the SafePub algorithm (#237)	6 år sedan
data	226562ab24 Revert changes to example file	4 år sedan
doc	858e2dcbbf Update Javadoc	5 år sedan
lib	fce74d871d Merge DPI scaling fixes (#411)	2 år sedan
logo	8f48301f56 Add highres logo	4 år sedan
src	c8c26c95e4 Cleanups	1 år sedan
.gitignore	e697b2a025 Cleanups	1 år sedan
.mailmap	0e2072387d Fix typos and update mailmap	4 år sedan
CITATION.cff	af772a4830 Fix symbol	2 år sedan
LICENSE	31a409f859 Update LICENSE (#379)	2 år sedan
arx.iml	e41d45714a Improve support for high-dimensional data in core and GUI (#265)	5 år sedan
build.xml	558c03f2a9 Bump version number to 3.9.1	2 år sedan
code_of_conduct.md	c684958b96 Update code_of_conduct.md	5 år sedan
contributing.md	fa4aae366c Create contributing.md	5 år sedan
pom.xml	d2db9ec439 Bump sqlite-jdbc from 3.7.2 to 3.41.2.2	1 år sedan
readme.md	9aaced0a01 Change year in license texts to 2023 (#440)	1 år sedan
ruleset.xml	0abbd39a84 Create ruleset.xml	1 år sedan

ARX - Open Source Data Anonymization Software

Note

ARX is the result of a research project. To support our research, please cite one of our papers instead of referencing our website in scientific articles. You can find an overview of papers about ARX here. If you are not sure which paper to cite, we recommend this one:

Prasser F., Eicher J., Spengler H., Bild R., Kuhn K. A. (2020) Flexible Data Anonymization Using ARX — Current Status and Challenges Ahead. Software Pract Exper 2020;1–28. (Link)

Thanks!

Introduction

ARX is a comprehensive open source software for anonymizing sensitive personal data. It has been designed from the ground up to provide high scalability, ease of use and a tight integration of the many different aspects relevant to data anonymization. Its highlights include:

Utility-focused anonymization using different statistical models
Syntactic privacy models, such as k-anonymity, ℓ-diversity, t-closeness and δ-presence
Semantic privacy models, such as (ɛ, δ)-differential privacy
Methods for optimizing the profitability of data publishing based on monetary cost-benefit analyses
Data transformation with generalization, suppression, microaggregation and top/bottom coding as well as global and local recoding
Methods for analyzing data utility
Methods for analyzing re-identification risks

The software is able to handle very large datasets on commodity hardware and features an intuitive cross-platform graphical user interface. You can find further information on the project website.

Development setup

Currently, the main development of ARX is carried out using Eclipse as an IDE and Ant as a build tool. Support for further IDEs such as IntelliJ IDEA and Maven is experimental.

The Ant build script features various targets that can be used to build different versions of ARX (e.g. including GUI code or not). To build only the core code using Maven, set the system property core to true. This will build a platform independent jar with the ARX main code module and no GUI components:

$ mvn compile -Dcore=true

Contributing and code of conduct

See here and here.

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

External Libraries

ARX uses external libraries. Their licenses are listed in the respective folders.

readme.md