SlideShare a Scribd company logo
4
Most read
12
Most read
21
Most read
DEPARTMENT OF COMPUTER SCIENCE,
UNIVERSITY OF DELHI
Movie Recommendation
System
Dr. Bharti Rana
Submitted To:
Uddeshy Kr. Bhagat(56)
Pooja Sahu(33)
MCA Semester 5
Submitted By:
About the Project
• Introduction
• Statement of the Problem
• Objective
• Related Work
• Methodology
• Summary and Conclusion
• A recommendation system is any system that automatically suggests content, products
or serivces which should interest customers based on their preferences
What is recommedation system?
• Help to increase the site’s page views, dwell time, click-through rate, and retention
• Help generate more advertising revenue
• Increase upselling and crossselling
Why recommendation system is useful for organization?
movie recommender system using vectorization and SVD tech
Approach 1 : Content Based
Approach 2 : Collaborative Filtering (CF) Based
Related Work
Movie Recommendation Systems Using Actor-Based
Matrix Computations in South Korea
• The paper focuses on a content-based movie recommendation system in South Korea, utilizing data
from the Korean Film Council, Naver movie site, and Korea Box Office Information System.
• Data preprocessing steps include removing duplicates, excluding certain genres, and incorporating
additional movie-related information, with a final dataset combining actor, director, genre, plot, and
viewer-related details.
• Correlation Analysis and User Evaluation:
• The study involves computing rank correlation between movies and genres, as well as the correlation
between actors and genres using Pearson's correlation coefficient.
• Results reveal notable differences in correlation patterns between movies and genres compared to
actors and movie genres. Additionally, a user survey is conducted to assess the recommendation
system, with participants expressing preferences based on movie genres or actors.
paper1
• The second paper discusses Collaborative Filtering (CF), specifically User-
CF and Item-CF, as mainstream recommendation algorithms, focusing on
personalized movie recommendations by identifying users with similar
interests.
• Singular Value Decomposition (SVD) is highlighted as a superior algorithm
in the Movie Recommendation System based on the MovieLens dataset,
outperforming Item-CF and User-CF, especially in scenarios with sparse
data.
Movie Recommendation System Based on
SVD Collaborative Filtering
Paper2
movies_metadata.csv
• The main Movies Metadata file. Contains information on 45,000 movies featured in
the Full MovieLens dataset. Features include posters, backdrops, budget, revenue,
release dates, languages, production countries and companies.
keywords.csv
• Contains the movie plot keywords for our MovieLens movies. Available in the form of
a stringified JSON Object.
credits.csv • Consists of Cast and Crew Information for all our movies. Available in the form of a
stringified JSON Object.
links.csv • The file that contains the TMDB and IMDB IDs of all the movies featured.
links_small.csv • Contains the TMDB and IMDB IDs of a small subset of 9,000 movies.
ratings_small.csv • The subset of 100,000 ratings from 700 users on 9,000 movies.
The Movies Dataset https://grouplens.org/datasets/movielens/latest/li
WorkFlow OF content
based and improved
Recommendation
system
Stemming
Stemming is a natural language processing technique that is used to reduce words to
their base form, also known as the root form. The process of stemming is used to
normalize text and make it easier to process.
For each document, a vector is created where each element represents the count of the corresponding term
in the document. The result is a matrix where rows correspond to documents, and columns correspond to
terms in the vocabulary.
Here's a simple example:
Suppose you have 4 documents:
Document 1:"The greatest thing of life is love."
Document 2: "Love is great, it's great to be loved."
Document 3: "Is love the greatest thing?"
Document 4:"I love lasagna for 1000 times."
Count Vectorization
Cosine similarity is a metric used to measure how similar two vectors are, especially in the context
of text data and document similarity. It calculates the cosine of the angle between two vectors,
providing a measure of similarity irrespective of their magnitude
Cosine similarity
Input: recommend(movie) takes a movie title (movie) as input for which the
user wants recommendations.
Output: The function prints the titles of 10 other movies that are most similar
to the input movie.
output
Output of content based model
Improved Content Based
It Uses Vote Count and vote average calculates weighted rating which can be
calculated as:
where,
v is the number of votes for the movie
m is the minimum votes required to be listed in the chart
R is the average rating of the movie
C is the mean vote across the whole report
𝑊𝑒𝑖𝑔ℎ𝑡𝑒𝑑𝑅𝑎𝑡𝑖𝑛𝑔(𝑊𝑅) = (𝑣.R/(𝑣+𝑚))+(𝑚.C/(𝑣+𝑚))
Collaborative Filtering Based
Our content based engine suffers from some
severe limitations.
• It is only capable of suggesting movies which are close to a
certain movie. That is, it is not capable of capturing tastes
and providing recommendations across genres.
• Also, the engine that we built is not really personal in that it
doesn't capture the personal tastes and biases of a user.
Anyone querying our engine for recommendations based on a
movie will receive the same recommendations for that movie,
regardless of who (s)he is.
• Therefore, in this section, we will use Collaborative Filtering to
make recommendations to Movie Watchers. Collaborative
Filtering is based on the idea that users similar to a me can be
used to predict how much I will like a particular product or
service those users have used/experienced but I have not
SVD (Singular Value Decomposition) is a popular matrix
factorization technique commonly used in collaborative
filtering-based movie recommendation systems. In a
movie recommendation system, SVD is applied to
decompose the user-item interaction matrix into three
matrices:
• User Matrix (U): Represents users and their latent
factors.
• Item Matrix (V): Represents items (movies) and their
latent factors.
• Singular Value Matrix (Σ): Represents the singular
values.
SVD Model
The user-item interaction matrix M is
decomposed as M is nearly equal to U.Σ.VT
Hybrid Movie Recommendation System
Build a simple hybrid recommender that brings together techniques we have
implemented in the content based and collaborative filter based engines.
Summary
Content Based Recommender
We built two content based engines;
one that took movie overview and
taglines as input and the other which
took metadata such as cast, crew,
genre and keywords to come up with
predictions. We also devised a simple
filter to give greater preference to
movies with more votes and higher
ratings.
Improved Content Based
Recommender
This system used overall TMDB Vote
Count and Vote Averages to build
Top Movies Charts, in general and for
a specific genre. The IMDB Weighted
Rating System was used to calculate
ratings on which the sorting was
finally performed.
In this Project, we have built 4 different recommendation engines based on
different ideas and algorithms. They are as follows:
Summary
Collaborative Filtering
We used the powerful
Surprise Library to build a
collaborative filter based
on single value
decomposition. The RMSE
obtained was less than 1
and the engine gave
estimated ratings for a
given user and movie.
Hybrid Engine.
We brought together ideas
from content and
collaborative filtering to
build an engine that gave
movie suggestions to a
particular user based on
the estimated ratings that
it had internally calculated
for that user.
Conclusion
We see that for our hybrid recommender, we get different
recommendations for different users although the movie is the same.
Hence, our recommendations are more personalised and tailored towards
particular users.
Thank You for listening!

More Related Content

PPTX
MOVIE RECOMMENDATION SYSTEM.pptx
PPTX
Movie Recommender System Using Artificial Intelligence
PPTX
Movie Recommendation System - MovieLens Dataset
PPTX
Movie Recommendation System using ml.pptx
PDF
final report.pdf
PPT
A Hybrid Recommendation system
PPTX
Movie lens movie recommendation system
PPTX
Internet Movie Database (IMDB) Presentation
MOVIE RECOMMENDATION SYSTEM.pptx
Movie Recommender System Using Artificial Intelligence
Movie Recommendation System - MovieLens Dataset
Movie Recommendation System using ml.pptx
final report.pdf
A Hybrid Recommendation system
Movie lens movie recommendation system
Internet Movie Database (IMDB) Presentation

Similar to movie recommender system using vectorization and SVD tech (20)

PDF
Movie_Recommendation.pdf
PDF
IRJET- Hybrid Recommendation System for Movies
PDF
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
PDF
ENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNING
PPTX
Movie recommendation Engine using Artificial Intelligence
PDF
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
PDF
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
PDF
Recommendation System using Machine Learning Techniques
PPTX
Movie recommendation system using collaborative filtering system
PDF
20320140501009 2
PPTX
MovieRecommenderSystem.pptx
PPTX
Recommender systems: Content-based and collaborative filtering
PDF
A Review Study OF Movie Recommendation Using Machine Learning
PPTX
movierecommendationproject-171223181147.pptx
PPTX
Presentation uw wjsjjsjs kennw kekke kkejnen
PDF
An Efficient Content, Collaborative – Based and Hybrid Approach for Movie Rec...
PDF
Movie recommendation project
PDF
Ai used movie recommendations githubbbbb
PDF
Recommendation Systems
PPTX
Design Recommender systems from scratch
Movie_Recommendation.pdf
IRJET- Hybrid Recommendation System for Movies
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
ENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNING
Movie recommendation Engine using Artificial Intelligence
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
PyCon Balkans 2018 // Recommender systems - collaborative filtering and dimen...
Recommendation System using Machine Learning Techniques
Movie recommendation system using collaborative filtering system
20320140501009 2
MovieRecommenderSystem.pptx
Recommender systems: Content-based and collaborative filtering
A Review Study OF Movie Recommendation Using Machine Learning
movierecommendationproject-171223181147.pptx
Presentation uw wjsjjsjs kennw kekke kkejnen
An Efficient Content, Collaborative – Based and Hybrid Approach for Movie Rec...
Movie recommendation project
Ai used movie recommendations githubbbbb
Recommendation Systems
Design Recommender systems from scratch
Ad

Recently uploaded (20)

PPTX
Cell Types and Its function , kingdom of life
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Computing-Curriculum for Schools in Ghana
PDF
01-Introduction-to-Information-Management.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Pharma ospi slides which help in ospi learning
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Cell Types and Its function , kingdom of life
A systematic review of self-coping strategies used by university students to ...
Anesthesia in Laparoscopic Surgery in India
Computing-Curriculum for Schools in Ghana
01-Introduction-to-Information-Management.pdf
Microbial disease of the cardiovascular and lymphatic systems
human mycosis Human fungal infections are called human mycosis..pptx
Pharma ospi slides which help in ospi learning
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
VCE English Exam - Section C Student Revision Booklet
Microbial diseases, their pathogenesis and prophylaxis
Orientation - ARALprogram of Deped to the Parents.pptx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Pharmacology of Heart Failure /Pharmacotherapy of CHF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Final Presentation General Medicine 03-08-2024.pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Ad

movie recommender system using vectorization and SVD tech

  • 1. DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF DELHI Movie Recommendation System Dr. Bharti Rana Submitted To: Uddeshy Kr. Bhagat(56) Pooja Sahu(33) MCA Semester 5 Submitted By:
  • 2. About the Project • Introduction • Statement of the Problem • Objective • Related Work • Methodology • Summary and Conclusion
  • 3. • A recommendation system is any system that automatically suggests content, products or serivces which should interest customers based on their preferences What is recommedation system? • Help to increase the site’s page views, dwell time, click-through rate, and retention • Help generate more advertising revenue • Increase upselling and crossselling Why recommendation system is useful for organization?
  • 5. Approach 1 : Content Based
  • 6. Approach 2 : Collaborative Filtering (CF) Based
  • 7. Related Work Movie Recommendation Systems Using Actor-Based Matrix Computations in South Korea • The paper focuses on a content-based movie recommendation system in South Korea, utilizing data from the Korean Film Council, Naver movie site, and Korea Box Office Information System. • Data preprocessing steps include removing duplicates, excluding certain genres, and incorporating additional movie-related information, with a final dataset combining actor, director, genre, plot, and viewer-related details. • Correlation Analysis and User Evaluation: • The study involves computing rank correlation between movies and genres, as well as the correlation between actors and genres using Pearson's correlation coefficient. • Results reveal notable differences in correlation patterns between movies and genres compared to actors and movie genres. Additionally, a user survey is conducted to assess the recommendation system, with participants expressing preferences based on movie genres or actors. paper1
  • 8. • The second paper discusses Collaborative Filtering (CF), specifically User- CF and Item-CF, as mainstream recommendation algorithms, focusing on personalized movie recommendations by identifying users with similar interests. • Singular Value Decomposition (SVD) is highlighted as a superior algorithm in the Movie Recommendation System based on the MovieLens dataset, outperforming Item-CF and User-CF, especially in scenarios with sparse data. Movie Recommendation System Based on SVD Collaborative Filtering Paper2
  • 9. movies_metadata.csv • The main Movies Metadata file. Contains information on 45,000 movies featured in the Full MovieLens dataset. Features include posters, backdrops, budget, revenue, release dates, languages, production countries and companies. keywords.csv • Contains the movie plot keywords for our MovieLens movies. Available in the form of a stringified JSON Object. credits.csv • Consists of Cast and Crew Information for all our movies. Available in the form of a stringified JSON Object. links.csv • The file that contains the TMDB and IMDB IDs of all the movies featured. links_small.csv • Contains the TMDB and IMDB IDs of a small subset of 9,000 movies. ratings_small.csv • The subset of 100,000 ratings from 700 users on 9,000 movies. The Movies Dataset https://grouplens.org/datasets/movielens/latest/li
  • 10. WorkFlow OF content based and improved Recommendation system
  • 11. Stemming Stemming is a natural language processing technique that is used to reduce words to their base form, also known as the root form. The process of stemming is used to normalize text and make it easier to process.
  • 12. For each document, a vector is created where each element represents the count of the corresponding term in the document. The result is a matrix where rows correspond to documents, and columns correspond to terms in the vocabulary. Here's a simple example: Suppose you have 4 documents: Document 1:"The greatest thing of life is love." Document 2: "Love is great, it's great to be loved." Document 3: "Is love the greatest thing?" Document 4:"I love lasagna for 1000 times." Count Vectorization
  • 13. Cosine similarity is a metric used to measure how similar two vectors are, especially in the context of text data and document similarity. It calculates the cosine of the angle between two vectors, providing a measure of similarity irrespective of their magnitude Cosine similarity
  • 14. Input: recommend(movie) takes a movie title (movie) as input for which the user wants recommendations. Output: The function prints the titles of 10 other movies that are most similar to the input movie. output Output of content based model
  • 15. Improved Content Based It Uses Vote Count and vote average calculates weighted rating which can be calculated as: where, v is the number of votes for the movie m is the minimum votes required to be listed in the chart R is the average rating of the movie C is the mean vote across the whole report 𝑊𝑒𝑖𝑔ℎ𝑡𝑒𝑑𝑅𝑎𝑡𝑖𝑛𝑔(𝑊𝑅) = (𝑣.R/(𝑣+𝑚))+(𝑚.C/(𝑣+𝑚))
  • 16. Collaborative Filtering Based Our content based engine suffers from some severe limitations. • It is only capable of suggesting movies which are close to a certain movie. That is, it is not capable of capturing tastes and providing recommendations across genres. • Also, the engine that we built is not really personal in that it doesn't capture the personal tastes and biases of a user. Anyone querying our engine for recommendations based on a movie will receive the same recommendations for that movie, regardless of who (s)he is. • Therefore, in this section, we will use Collaborative Filtering to make recommendations to Movie Watchers. Collaborative Filtering is based on the idea that users similar to a me can be used to predict how much I will like a particular product or service those users have used/experienced but I have not
  • 17. SVD (Singular Value Decomposition) is a popular matrix factorization technique commonly used in collaborative filtering-based movie recommendation systems. In a movie recommendation system, SVD is applied to decompose the user-item interaction matrix into three matrices: • User Matrix (U): Represents users and their latent factors. • Item Matrix (V): Represents items (movies) and their latent factors. • Singular Value Matrix (Σ): Represents the singular values. SVD Model The user-item interaction matrix M is decomposed as M is nearly equal to U.Σ.VT
  • 18. Hybrid Movie Recommendation System Build a simple hybrid recommender that brings together techniques we have implemented in the content based and collaborative filter based engines.
  • 19. Summary Content Based Recommender We built two content based engines; one that took movie overview and taglines as input and the other which took metadata such as cast, crew, genre and keywords to come up with predictions. We also devised a simple filter to give greater preference to movies with more votes and higher ratings. Improved Content Based Recommender This system used overall TMDB Vote Count and Vote Averages to build Top Movies Charts, in general and for a specific genre. The IMDB Weighted Rating System was used to calculate ratings on which the sorting was finally performed. In this Project, we have built 4 different recommendation engines based on different ideas and algorithms. They are as follows:
  • 20. Summary Collaborative Filtering We used the powerful Surprise Library to build a collaborative filter based on single value decomposition. The RMSE obtained was less than 1 and the engine gave estimated ratings for a given user and movie. Hybrid Engine. We brought together ideas from content and collaborative filtering to build an engine that gave movie suggestions to a particular user based on the estimated ratings that it had internally calculated for that user.
  • 21. Conclusion We see that for our hybrid recommender, we get different recommendations for different users although the movie is the same. Hence, our recommendations are more personalised and tailored towards particular users.
  • 22. Thank You for listening!