Course Outline

Introduction

  • Apache Arrow vs Parquet

Installing and Configuring Apache Arrow

Overview of Apache Arrow Features and Architecture

Exploring Data with Pandas and Apache Arrow

Exploring Data with Spark and Apache Arrow

Exploring Data with R and Apache Arrow

Exploring Data with MapD and Apache Arrow

Other Data Analysis Integrations

  • PySpark, Parquet files on S3, and Oracle tables and Elasticsearch indices

Troubleshooting

Summary and Conclusion

Requirements

  • A basic undersanding of SQL
  • Familiarity with Python or R
  • Some familiarity with Apache Spark
  14 Hours
 

Testimonials (3)

Related Courses

Related Categories