安装 Apache Arrow
当前版本:19.0.0 (2025-01-16)
有关新增功能的更多信息,请参阅[发行说明][10]。有关先前版本的信息,请参阅[此处][19]。Rust 和 Julia 库单独发布。有关详细信息,请参阅以下页面
- Rust:[arrow crate 的文档][26]
- Julia:[Arrow.jl 包的存储库][27]
此页面是发行工件和包管理器的参考列表。有关特定语言的用户指南,请参阅上面“文档”菜单中列出的页面。
源代码发行版
- 源代码发行版:[apache-arrow-19.0.0.tar.gz][6]
- 验证:[asc 签名][13],[sha256 校验和][14],[sha512 校验和][15],([验证说明][12])
- Git 标签 a999eaccb12378f9e4e9ab758f18edc25b0991e5
- [发行版签名的 GPG 密钥][11]
Java 包
[Maven Central 上的 Java 工件][4]
Python Wheels
我们已在 PyPI 上为 Linux、macOS 和 Windows 提供了官方二进制 wheel 文件。
pip install pyarrow==19.0.*
我们建议在 requirements.txt
中将版本固定为 19.0.*
以安装最新的补丁版本。
这些 wheel 文件包含 Apache Arrow 和 Apache Parquet C++ 二进制库。
Go 模块
Go 模块已标记其版本,可以使用 go get
轻松安装
go get github.com/apache/arrow/go/[email protected]
然后可以使用以下命令导入 Apache Arrow 模块
import "github.com/apache/arrow/go/v/arrow"
适用于 Debian GNU/Linux、Ubuntu、AlmaLinux、CentOS 和 Amazon Linux 的 C++ 和 GLib (C) 包
我们为 Apache Arrow C++ 和 Apache Arrow GLib (C) 提供了 APT 和 Yum 存储库。以下是受支持的平台
- Debian GNU/Linux bullseye
- Debian GNU/Linux bookworm
- Debian GNU/Linux trixie
- Ubuntu 20.04 LTS
- Ubuntu 22.04 LTS
- AlmaLinux 8
- AlmaLinux 9
- CentOS 7
- CentOS Stream 8
- CentOS Stream 9
- Red Hat Enterprise Linux 7
- Red Hat Enterprise Linux 8
- Red Hat Enterprise Linux 9
- Amazon Linux 2023
- Oracle Linux 8
- Oracle Linux 9
Debian GNU/Linux 和 Ubuntu
sudo apt update
sudo apt install -y -V ca-certificates lsb-release wget
wget https://apache.jfrog.io/artifactory/arrow/$(lsb_release --id --short | tr 'A-Z' 'a-z')/apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb
sudo apt install -y -V ./apache-arrow-apt-source-latest-$(lsb_release --codename --short).deb
sudo apt update
sudo apt install -y -V libarrow-dev # For C++
sudo apt install -y -V libarrow-glib-dev # For GLib (C)
sudo apt install -y -V libarrow-dataset-dev # For Apache Arrow Dataset C++
sudo apt install -y -V libarrow-dataset-glib-dev # For Apache Arrow Dataset GLib (C)
sudo apt install -y -V libarrow-acero-dev # For Apache Arrow Acero
sudo apt install -y -V libarrow-flight-dev # For Apache Arrow Flight C++
sudo apt install -y -V libarrow-flight-glib-dev # For Apache Arrow Flight GLib (C)
sudo apt install -y -V libarrow-flight-sql-dev # For Apache Arrow Flight SQL C++
sudo apt install -y -V libarrow-flight-sql-glib-dev # For Apache Arrow Flight SQL GLib (C)
sudo apt install -y -V libgandiva-dev # For Gandiva C++
sudo apt install -y -V libgandiva-glib-dev # For Gandiva GLib (C)
sudo apt install -y -V libparquet-dev # For Apache Parquet C++
sudo apt install -y -V libparquet-glib-dev # For Apache Parquet GLib (C)
AlmaLinux 8/9、Oracle Linux 8/9、Red Hat Enterprise Linux 8/9 和 CentOS Stream 8/9
sudo dnf install -y epel-release || sudo dnf install -y oracle-epel-release-el$(cut -d: -f5 /etc/system-release-cpe | cut -d. -f1) || sudo dnf install -y https://dl.fedoraproject.org/pub/epel/epel-release-latest-$(cut -d: -f5 /etc/system-release-cpe | cut -d. -f1).noarch.rpm
sudo dnf install -y https://apache.jfrog.io/artifactory/arrow/almalinux/$(cut -d: -f5 /etc/system-release-cpe | cut -d. -f1)/apache-arrow-release-latest.rpm
sudo dnf config-manager --set-enabled epel || :
sudo dnf config-manager --set-enabled powertools || :
sudo dnf config-manager --set-enabled crb || :
sudo dnf config-manager --set-enabled ol$(cut -d: -f5 /etc/system-release-cpe | cut -d. -f1)_codeready_builder || :
sudo dnf config-manager --set-enabled codeready-builder-for-rhel-$(cut -d: -f5 /etc/system-release-cpe | cut -d. -f1)-rhui-rpms || :
sudo subscription-manager repos --enable codeready-builder-for-rhel-$(cut -d: -f5 /etc/system-release-cpe | cut -d. -f1)-$(arch)-rpms || :
sudo dnf install -y arrow-devel # For C++
sudo dnf install -y arrow-glib-devel # For GLib (C)
sudo dnf install -y arrow-dataset-devel # For Apache Arrow Dataset C++
sudo dnf install -y arrow-dataset-glib-devel # For Apache Arrow Dataset GLib (C)
sudo dnf install -y arrow-acero-devel # For Apache Arrow Acero C++
sudo dnf install -y arrow-flight-devel # For Apache Arrow Flight C++
sudo dnf install -y arrow-flight-glib-devel # For Apache Arrow Flight GLib (C)
sudo dnf install -y arrow-flight-sql-devel # For Apache Arrow Flight SQL C++
sudo dnf install -y arrow-flight-sql-glib-devel # For Apache Arrow Flight SQL GLib (C)
sudo dnf install -y gandiva-devel # For Apache Gandiva C++
sudo dnf install -y gandiva-glib-devel # For Apache Gandiva GLib (C)
sudo dnf install -y parquet-devel # For Apache Parquet C++
sudo dnf install -y parquet-glib-devel # For Apache Parquet GLib (C)
CentOS 7 和 Red Hat Enterprise Linux 7
sudo yum install -y epel-release || sudo yum install -y https://dl.fedoraproject.org/pub/epel/epel-release-latest-$(cut -d: -f5 /etc/system-release-cpe | cut -d. -f1).noarch.rpm
sudo yum install -y https://apache.jfrog.io/artifactory/arrow/centos/$(cut -d: -f5 /etc/system-release-cpe | cut -d. -f1)/apache-arrow-release-latest.rpm
sudo yum install -y --enablerepo=epel arrow-devel # For C++
sudo yum install -y --enablerepo=epel arrow-glib-devel # For GLib (C)
sudo yum install -y --enablerepo=epel arrow-dataset-devel # For Apache Arrow Dataset C++
sudo yum install -y --enablerepo=epel arrow-dataset-glib-devel # For Apache Arrow Dataset GLib (C)
sudo yum install -y --enablerepo=epel arrow-acero-devel # For Apache Arrow Acero
sudo yum install -y --enablerepo=epel parquet-devel # For Apache Parquet C++
sudo yum install -y --enablerepo=epel parquet-glib-devel # For Apache Parquet GLib (C)
Amazon Linux 2023
sudo dnf install -y https://apache.jfrog.io/artifactory/arrow/amazon-linux/$(cut -d: -f6 /etc/system-release-cpe)/apache-arrow-release-latest.rpm
sudo dnf install -y arrow-devel # For C++
sudo dnf install -y arrow-glib-devel # For GLib (C)
sudo dnf install -y arrow-acero-devel # For Apache Arrow Acero
sudo dnf install -y arrow-dataset-devel # For Apache Arrow Dataset C++
sudo dnf install -y arrow-dataset-glib-devel # For Apache Arrow Dataset GLib (C)
sudo dnf install -y arrow-flight-devel # For Apache Arrow Flight C++
sudo dnf install -y arrow-flight-glib-devel # For Apache Arrow Flight GLib (C)
sudo dnf install -y arrow-flight-sql-devel # For Apache Arrow Flight SQL C++
sudo dnf install -y arrow-flight-sql-glib-devel # For Apache Arrow Flight SQL GLib (C)
sudo dnf install -y gandiva-devel # For Apache Gandiva C++
sudo dnf install -y gandiva-glib-devel # For Apache Gandiva GLib (C)
sudo dnf install -y parquet-devel # For Apache Parquet C++
sudo dnf install -y parquet-glib-devel # For Apache Parquet GLib (C)
C# 包
我们为 Apache Arrow C# 提供了 NuGet 包
- [Apache.Arrow][22]
- [Apache.Arrow.Flight][23]
- [Apache.Arrow.Flight.AspNetCore][24]
其他安装程序
为了方便起见,我们还通过多个包管理器提供包。其中许多包以二进制形式提供,从源代码发行版构建。由于 Apache Arrow PMC 没有对这些包进行明确投票,因此从技术上讲,它们被视为非官方发行版。
C++ 和 Python Conda 包
二进制 conda 包位于 [conda-forge][5] 上,适用于 Linux(x86_64、aarch64、ppc64le)、macOS(x86_64 和 arm64)和 Windows (x86_64),适用于以下版本
- Python 3.8, 3.9, 3.10, 3.11
- R 4.1, 4.2, 4.3
使用以下命令安装它们
conda install arrow-cpp=19.0.* -c conda-forge
conda install pyarrow=19.0.* -c conda-forge
conda install r-arrow=19.0.* -c conda-forge
Homebrew 上的 C++ 和 GLib (C) 包
在 macOS 上,您可以使用 [Homebrew][17] 安装 C++ 库
brew install apache-arrow
并使用以下命令安装 GLib (C) 包
brew install apache-arrow-glib
MSYS2 的 C++ 和 GLib (C) 包
MSYS2 包包括 [Apache Arrow C++ 和 GLib (C) 包][16]。您可以使用 pacman
安装该包。
UCRT 64 位版本
pacman -S --noconfirm mingw-w64-ucrt-x86_64-arrow
64 位版本
pacman -S --noconfirm mingw-w64-x86_64-arrow
32 位版本
pacman -S --noconfirm mingw-w64-i686-arrow
vcpkg 上的 C++ 包
您可以使用 vcpkg 依赖管理器下载并安装 Apache Arrow C++
git clone https://github.com/Microsoft/vcpkg.git
cd vcpkg
./bootstrap-vcpkg.sh
./vcpkg integrate install
./vcpkg install arrow
vcpkg 中的 Apache Arrow C++ 端口由 Microsoft 团队成员和社区贡献者维护。如果版本过时,请在 vcpkg 存储库上[创建问题或拉取请求][18]。
CRAN 上的 R 包
使用以下命令从 [CRAN][20] 安装 R 包
install.packages("arrow")
RubyGems 上的 Ruby 包
使用以下命令从 [RubyGems][25] 安装适用于 Ruby 3.0、3.1 和 3.2 的 Ruby 包
gem install red-arrow
gem install red-arrow-cuda # For CUDA support
gem install red-arrow-dataset # For Apache Arrow Dataset support
gem install red-arrow-flight # For Apache Arrow Flight support
gem install red-arrow-flight-sql # For Apache Arrow Flight SQL support
gem install red-gandiva # For Gandiva support
gem install red-parquet # For Apache Parquet support
[4]: [5]: https://conda-forge.github.io [6]: https://apache.org/dyn/closer.lua?action=download&filename=arrow/arrow-19.0.0/apache-arrow-19.0.0.tar.gz [10]: /release/19.0.0.html [11]: https://downloads.apache.org/arrow/KEYS [12]: https://apache.org/dyn/closer.cgi#verify [13]: https://downloads.apache.org/arrow/arrow-19.0.0/apache-arrow-19.0.0.tar.gz.asc [14]: https://downloads.apache.org/arrow/arrow-19.0.0/apache-arrow-19.0.0.tar.gz.sha256 [15]: https://downloads.apache.org/arrow/arrow-19.0.0/apache-arrow-19.0.0.tar.gz.sha512 [16]: https://github.com/msys2/MINGW-packages/tree/HEAD/mingw-w64-arrow [17]: https://brew.sh.cn/ [18]: https://github.com/Microsoft/vcpkg [19]: /release/ [20]: https://cran.r-project.org.cn/ [22]: https://nuget.net.cn/packages/Apache.Arrow/ [23]: https://nuget.net.cn/packages/Apache.Arrow.Fligth/ [24]: https://nuget.net.cn/packages/Apache.Arrow.Flight.AspNetCore/ [25]: https://rubygems.org.cn/ [26]: https://docs.rs/crate/arrow/latest [27]: https://github.com/apache/arrow-julia/#readme