Analyzing datasets that exceed your system’s memory can be a significant challenge, but the right tools can make it manageable. In this session, we'll explore how to use Apache Arrow—a high-performance, multi-language framework for working with larger-than-memory tabular data—together with DuckDB, a fast and lightweight embedded database system. In this webinar we’ll guide you through combining these powerful tools to build efficient, scalable data analysis pipelines directly in R. This webinar will equip you with practical strategies to overcome memory limitations and enhance your data processing capabilities in R.
Speaker: Pete Lawson
** This session is part of Love Data Week 2025. To attend this session, first register here: https://bit.ly/JH_lovedataweek and then follow the instruction under "Registration and Creating an Itinerary" in the description to add this session to your itinerary. **