Skip to content

Latest commit

 

History

History
205 lines (156 loc) · 7.03 KB

install.md

File metadata and controls

205 lines (156 loc) · 7.03 KB
layout title description
default
Installation
Instructions for installing the latest release of Apache Arrow

Current Version: {{site.data.versions['current'].number}}

Released: {{site.data.versions['current'].date}}

See the release notes for more about what's new.

Source Release

Java Packages

Java Artifacts on Maven Central

Python Wheels

We have provided official binary wheels on PyPI for Linux, macOS, and Windows:

pip install pyarrow=={{site.data.versions['current'].pinned_number}}

We recommend pinning {{site.data.versions['current'].pinned_number}} in requirements.txt to install the latest patch release.

These include the Apache Arrow and Apache Parquet C++ binary libraries bundled with the wheel.

C++ and GLib (C) Packages for Debian GNU/Linux, Ubuntu and CentOS

We have provided APT and Yum repositories for Apache Arrow C++ and Apache Arrow GLib (C). Here are supported platforms:

  • Debian GNU/Linux stretch
  • Debian GNU/Linux buster
  • Ubuntu 16.04 LTS
  • Ubuntu 18.04 LTS
  • Ubuntu 19.10
  • CentOS 6
  • CentOS 7
  • CentOS 8
  • Amazon Linux 2

Debian GNU/Linux and Ubuntu:

sudo apt update
sudo apt install -y -V lsb-release wget
if [ $(lsb_release --codename --short) = "stretch" ]; then
  sudo tee /etc/apt/sources.list.d/backports.list <<APT_LINE
deb http://deb.debian.org/debian $(lsb_release --codename --short)-backports main
APT_LINE
fi
wget https://apache.bintray.com/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-archive-keyring-latest-$(lsb_release --codename --short).deb
sudo apt install -y -V ./apache-arrow-archive-keyring-latest-$(lsb_release --codename --short).deb
sudo apt update
sudo apt install -y -V libarrow-dev # For C++
sudo apt install -y -V libarrow-glib-dev # For GLib (C)
sudo apt install -y -V libarrow-flight-dev # For Flight C++
sudo apt install -y -V libplasma-dev # For Plasma C++
sudo apt install -y -V libplasma-glib-dev # For Plasma GLib (C)
sudo apt install -y -V libgandiva-dev # For Gandiva C++
sudo apt install -y -V libgandiva-glib-dev # For Gandiva GLib (C)
sudo apt install -y -V libparquet-dev # For Apache Parquet C++
sudo apt install -y -V libparquet-glib-dev # For Apache Parquet GLib (C)

CentOS 8:

sudo dnf install -y https://apache.bintray.com/arrow/centos/$(cut -d: -f5 /etc/system-release-cpe)/apache-arrow-release-latest.rpm
sudo dnf install -y --enablerepo=epel --enablerepo=PowerTools arrow-devel # For C++
sudo dnf install -y --enablerepo=epel --enablerepo=PowerTools arrow-glib-devel # For GLib (C)
sudo dnf install -y --enablerepo=epel --enablerepo=PowerTools parquet-devel # For Apache Parquet C++
sudo dnf install -y --enablerepo=epel --enablerepo=PowerTools parquet-glib-devel # For Parquet GLib (C)

CentOS 6 and 7:

sudo yum install -y https://apache.bintray.com/arrow/centos/$(cut -d: -f5 /etc/system-release-cpe)/apache-arrow-release-latest.rpm
sudo yum install -y --enablerepo=epel arrow-devel # For C++
sudo yum install -y --enablerepo=epel arrow-glib-devel # For GLib (C)
sudo yum install -y --enablerepo=epel parquet-devel # For Apache Parquet C++
sudo yum install -y --enablerepo=epel parquet-glib-devel # For Parquet GLib (C)

Amazon Linux:

sudo yum install -y https://dl.fedoraproject.org/pub/epel/epel-release-latest-7.noarch.rpm
sudo yum install -y https://apache.bintray.com/arrow/centos/7/apache-arrow-release-latest.rpm
sudo yum install -y --enablerepo=epel arrow-devel # For C++
sudo yum install -y --enablerepo=epel arrow-glib-devel # For GLib (C)
sudo yum install -y --enablerepo=epel parquet-devel # For Apache Parquet C++
sudo yum install -y --enablerepo=epel parquet-glib-devel # For Parquet GLib (C)

Other Installers

For convenience, we also provide packages through several package managers. Many of them are provided as binary, built from the source release. As the Apache Arrow PMC has not explicitly voted on these packages, they are technically considered unofficial releases.

C++ and Python Conda Packages

Binary conda packages are on conda-forge for the following platforms:

  • Linux and macOS (Python 2.7, 3.6 and 3.7)
  • Windows (Python 3.6 and 3.7)

Install them with:

conda install arrow-cpp={{site.data.versions['current'].pinned_number}} -c conda-forge
conda install pyarrow={{site.data.versions['current'].pinned_number}} -c conda-forge

C++ and GLib (C) Packages on Homebrew

On macOS, you can install the C++ library using Homebrew:

brew install apache-arrow

and GLib (C) package with:

brew install apache-arrow-glib

C++ and GLib (C) Packages for MSYS2

The MSYS2 packages include Apache Arrow C++ and GLib (C) package. You can install the package by pacman.

64bit version:

pacman -S --noconfirm mingw-w64-x86_64-arrow

32bit version:

pacman -S --noconfirm mingw-w64-i686-arrow

C++ Package on vcpkg

You can download and install Apache Arrow C++ using the vcpkg dependency manager:

git clone https://github.com/Microsoft/vcpkg.git
cd vcpkg
./bootstrap-vcpkg.sh
./vcpkg integrate install
./vcpkg install arrow

The Apache Arrow C++ port in vcpkg is kept up to date by Microsoft team members and community contributors. If the version is out of date, please create an issue or pull request on the vcpkg repository.