Your marketing team wants to use product reviews data to gain insight on which products are liked by customers by state in the category of “Home and Grocery”. This will enable the business to plan new product offerings. The business users want to generate these reports more frequently and with the performance SLA of completing these reports in seconds. They also want to integrate the analyzed data back to the datalake on Amazon S3 to be used by various analytical applications.
To meet the business needs the data engineering team has come up with the following data model.
For this lab we will leverage the following datasets.
customer and customer_address tables are already created and loaded with data.