Skip to content

Explore Netflix's World ๐ŸŒ๐Ÿฟ. An in-depth analysis of Netflix's vast content library! Dive into the data behind over 10,000 movies and TV shows. Discover trends, genres, top creators, and more with Python and data visualizations. Uncover the magic of Netflix data!

License

Notifications You must be signed in to change notification settings

eshayalagi/Netflix_data_sci

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

4 Commits
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŽฌ "Netflix EDA with Content-Based Recommendationsโ€

Netflix Data Analysis & Recommendation System with Python ๐Ÿ“Š๐Ÿ”

Dive into the world of Netflix with Python! Analyze over 10,000+ titles and uncover trends, visual stories, and content similarities from your favorite streaming giant using data science and machine learning.


๐Ÿ“Œ Overview

Netflix, one of the worldโ€™s leading video streaming platforms, has a massive global library of movies ๐ŸŽฅ and TV shows ๐Ÿ“บ, serving 222M+ subscribers (as of mid-2021). This project delivers a complete exploratory data analysis (EDA) of Netflix content and a content-based recommendation engine using Python.

You'll discover insights about genres, ratings, countries, time trends, and even get recommendations for what to watch next โ€” all using real-world Netflix data.


๐Ÿ” Project Highlights

๐Ÿงฐ Libraries Used

  • Pandas ๐Ÿผ โ€“ data manipulation
  • NumPy ๐Ÿงฎ โ€“ numerical operations
  • Matplotlib ๐Ÿ“Š โ€“ visualizations
  • Seaborn ๐Ÿ“ˆ โ€“ statistical plotting
  • WordCloud โ˜๏ธ โ€“ generate textual data clouds
  • Scikit-learn ๐Ÿค– โ€“ ML & similarity computation

๐Ÿ“Š Dataset Columns

Column Description
show_id Unique identifier for each title
type Movie or TV Show
title Title of the content
director Directorโ€™s name
cast Main cast members
country Country of production
date_added Date added to Netflix
release_year Year of original release
rating Maturity rating (e.g., TV-MA, PG)
duration Duration in minutes or number of seasons
listed_in Genre(s) of the content
description Short summary or synopsis

๐Ÿ“ˆ Key Insights & Visualizations

  • ๐ŸŒ Country-wise Rating Distribution
  • ๐ŸŽญ Genre Trends by Country
  • ๐Ÿ”ž Genre vs. Rating Matrix
  • ๐Ÿ“Š Correlation Heatmaps
  • ๐ŸŽค Most Active Actors
  • โฑ๏ธ Content Duration Analysis
  • ๐Ÿ“… Time Trends in Content Addition
  • ๐Ÿ‘ถ Age Group Classification
  • ๐ŸŽฌ Top Genres & Directors
  • ๐ŸŒŽ Global Content Distribution
  • ๐Ÿฟ Movie vs. TV Show Breakdown

๐Ÿค– NEW: Content-Based Recommendation Engine

A lightweight machine learning engine recommends similar titles based on the title, genre, and description of Netflix content.

recommend("Stranger Things", n=5)

๐ŸŽฏ Suggests similar shows by comparing textual similarity using TF-IDF and cosine similarity. Great for demo, search enhancement, and personalized recommendations.


๐ŸŽฏ Goals of the Project

  • Deliver intuitive, visual summaries of Netflixโ€™s library
  • Practice real-world EDA and text-based ML techniques
  • Analyze how genres, countries, and ratings shape content strategy
  • Explore growth and evolution of Netflix over the years
  • Introduce a content recommender for viewers and analysts

๐ŸŒŸ Features

โœ… Cleaned and preprocessed dataset โœ… Beautiful plots with Seaborn and Matplotlib โœ… Word clouds for cast, genres, and descriptions โœ… Year-wise and country-wise analysis โœ… TF-IDFโ€“based recommendation system โœ… Easy-to-use recommend("Title") function


๐Ÿ”ฎ Future Enhancements

  • ๐Ÿ“Š Add interactive dashboards (Streamlit / Plotly Dash)
  • ๐ŸŽฅ Sentiment analysis on content descriptions
  • ๐Ÿง  Expand to collaborative filtering
  • ๐Ÿ”„ Integrate with Netflix API or scrape for live updates
  • ๐Ÿ” Compare with Prime Video, Hulu, Disney+

๐Ÿค Contributors

  • Esha Yalagi
  • Mudabbir

๐Ÿง  Topics & Skills

#ExploratoryDataAnalysis #DataVisualization #MachineLearning #ContentRecommendation #Netflix #Python #Pandas #Seaborn #ScikitLearn #WordCloud


๐Ÿ“ฃ About

Explore Netflixโ€™s World ๐ŸŒ๐Ÿฟ An engaging, visual deep-dive into Netflixโ€™s content library using Python. Whether you're a data enthusiast, student, or Netflix binge-watcher, this project helps you uncover the data science behind content trends and intelligent recommendations.


About

Explore Netflix's World ๐ŸŒ๐Ÿฟ. An in-depth analysis of Netflix's vast content library! Dive into the data behind over 10,000 movies and TV shows. Discover trends, genres, top creators, and more with Python and data visualizations. Uncover the magic of Netflix data!

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published