Neil Nazareth

Detail-oriented Data Analyst, skilled in SQL, Python and PowerBI

Movie Data Correlation & Insights
Python

In this project I worked with Python to explore variables and understand how various features for movies correlate with eachother.

The raw data was obtained through a public dataset available on Kaggle.

The raw data consisted of Movie data with fields like Revenue, Budget, Production Company, Runtime, Director, etc.

Initially, I assumed that a movie's gross revenue was strongly associated with a few primary factors such as the budget and production company. However, upon examining a correlation matrix of all variables, I discovered that although a film's budget does influence its revenue, the production company does not seem to be as significant. What I overlooked in my original assumption was that the number of votes appears to have a much more substantial impact on gross revenue.