Astronomy Extensions for Spark, or AXS, is a distributed framework for astronomical data processing based on Apache Spark. AXS provides simple Python API to enable fast cross-matching, querying and analysis of data from astronomical catalogs.
Before running AXS make sure you have Java v8 installed and JAVA_HOME variable set.
You will also need Python 3 with numpy, pandas and arrow packages.
To install AXS follow these steps (note that Anaconda installer is planned for the future):
- Download the latest AXS tarball from the realeases page.
- Unpack the tarball to a directory of your choosing.
- Set
SPARK_HOMEenvironment variable to point to the extraction directory. - Add
SPARK_HOME/binto yourPATHvariable. - Run the
axs-init-config.shscript to updatespark-defaults.confandhive-site.xmlfiles with the exactSPARK_HOMEpath.
And you're good to go!
Read more about starting and using AXS, and its architecture, in the documentation.
If you are using AXS in your scientific work, please cite this paper.