suss singapore university of social sciences aib551 end of course asse
Search for question
Question
The assessment is to assess your ability to design NLP solutions based on the following
case.
Amazon is one of the world's largest e-commerce companies, with a vast product range.
It receives hundreds of thousands of product reviews daily from its users worldwide.
These reviews contain valuable information about the product and customer sentiments
towards it.
However, with the volume of incoming data, it's nearly impossible for Amazon to
manually analyse all the reviews to extract actionable insights. This is where NLP
comes in. Your task is to create an NLP solution to help Amazon structure and analyse
the review data.
Here is an Amazon review dataset collected in the range of May 1996 - Oct 2018,
including reviews (ratings, text, helpfulness votes), product metadata (descriptions,
category information, price, brand, and image features), and links (also viewed/also
bought graphs).
You are encouraged to use the smaller per-category dense subsets, which have been
reduced to extract the k-core, such that each of the remaining users and items have k
reviews each.
Dataset link: https://cseweb.ucsd.edu/~jmcauley/datasets/amazon_v2//nQuestion la
Based on your task, which is to create an NLP solution to help Amazon structure and
analyse the review data, determine a scenario/problem that you are going to analyse
and design NLP solution for the identified scenario/problem. (word limit: 500)
(20 marks)
Question 1b
Design and implement relevant NLP analysis flow to address the identified questions,
which includes but is not limited to:
(i) Dataset preparation; (ii) Vocabulary building; (iii) Sentiment analysis; (iv) Named
Entity Recognition; (v) Topic modeling