A comprehensive data engineering project that transforms personal Netflix viewing data into actionable insights by combining Netflix's export data with TMDB API and implementing a complete ETL pipelin
e using GCP, Airflow, and Tableau.
The project demonstrates how to enrich basic streaming data with external sources to uncover viewing patterns and preferences.
Reasons to Read -- Learn:
how to build an end-to-end data pipeline using GCP services, from raw data storage to data warehouse implementation with practical examples of bucket organization and schema design.
how to enrich basic Netflix viewing data using TMDB API integration, showing how to transform limited source data (just titles and dates) into comprehensive analytics.
how to implement data orchestration using Airflow with specific examples of DAG organization for extract-load and transform operations in a real-world scenario.
4 min readauthor: Fabio Barbazza
0
What is ReadRelevant.ai?
We scan thousands of websites regularly and create a feed for you that is:
directly relevant to your current or aspired job roles, and
free from repetitive or redundant information.
Why Choose ReadRelevant.ai?
Discover best practices, out-of-box ideas for your role
Introduce new tools at work, decrease costs & complexity
Become the go-to person for cutting-edge solutions
Increase your productivity & problem-solving skills
Spark creativity and drive innovation in your work