arrow package for R provides access to Apache Arrow’s C++ library, enabling fast data processing, dplyr integration, and support for various data formats including Parquet and Arrow IPC.
Quick Install
Install the latest release from CRAN:Installation Methods
From CRAN (Recommended)
The simplest method for most users:Install from CRAN
From R-universe
R-universe provides pre-compiled binaries for the most common operating systems:Linux users: Consult the R-universe documentation for guidance on binary availability and repository configuration for your distribution.
From conda-forge
If you’re using conda for environment management:--strict-channel-priority flag ensures all dependencies come from conda-forge for consistency.
Platform-Specific Considerations
macOS
Linux
On Linux, CRAN does not provide pre-compiled binaries, so the package is built from source. The installation process attempts to automatically handle dependencies.For detailed Linux installation guidance, including distribution-specific instructions, see the official installation guide.
Ubuntu/Debian
For a smoother installation experience, you can pre-install system dependencies:RHEL/CentOS/Rocky Linux
Windows
Windows users can simply install from CRAN:Compiler Requirement: As of arrow 23.0.0, building from source requires C++20 support. On Windows, this means R 4.3 or later is required. R 4.2 has incomplete C++20 support and may work with special configuration.
Compiler Requirements
Verifying Your Installation
After installation, verify that arrow is working correctly:Check Arrow info
- Arrow version
- Available features (Parquet, Dataset, S3, etc.)
- Build configuration
Common Use Cases
Reading and Writing Parquet Files
Working with Large Datasets
Using dplyr with Arrow
Reading from CSV
Installing Nightly Builds
For the latest development features:Troubleshooting
Installation fails on Linux
Installation fails on Linux
Ensure system dependencies are installed:Then retry the installation in R. See the installation details for more help.
Segfaults on Apple Silicon Mac
Segfaults on Apple Silicon Mac
This usually indicates an architecture mismatch:
- Check your R architecture:
R.version$arch - Should be
aarch64for M1/M2/M3 Macs - If showing
x86_64, download ARM64 R from CRAN - Reinstall arrow after switching R versions
Features missing (e.g., no S3 support)
Features missing (e.g., no S3 support)
Check which features are available:If features are missing:
- Try installing from R-universe or conda-forge instead of CRAN
- CRAN builds include S3 but not GCS support
- See the cloud storage article for details
C++20 compiler errors on Windows
C++20 compiler errors on Windows
You need R 4.3 or later:
- Update R to version 4.3+
- Or install pre-built binaries from R-universe
- See installation details
Additional Resources
Arrow for R Documentation
Complete R package documentation
R Cookbook
Practical recipes for common tasks
Scaling Up with R and Arrow
Free online book
Cheatsheet
Quick reference guide
Next Steps
Now that you have arrow installed:- Learn about reading and writing files
- Explore data wrangling with dplyr
- Check out supported dplyr functions
- Try the R4DS Arrow chapter