How will you identify fake reviews on Flipkart? [Flipkart PM Ques]
Interviewer: How will you identify fake reviews on Flipkart? What will you do to decrease them?
Image source - https://www.searchenginejournal.com/
Priyanka: To get a better understanding of the problem, I would like to seek some clarification.
Interviewer: Yes, please go ahead.
Priyanka: Firstly I would like to clarify my understanding of Flipkart, it is an e-commerce platform where sellers interact with users and sell their products through the platform. Flipkart gets a commission from each sale made through its platform. It offers other services like travel booking, grocery etc.
Interviewer: Yes.
Priyanka: Now talking about fake reviews. Reviews are users’ opinions for a particular product which help other shoppers get a better understanding of the product. It usually contains text and might contain images also. For example — If a user buys a dress from Flipkart and finds it good they post a picture of the dress and write a review to describe their experience. With e-commerce growing in popularity a lot of users rely on reviews of the product posted on the platform. Reviews are meant to build trust in the product and provide a first-hand experience for users.
Fake reviews are information stated by a user to falsely say a product is good or bad. Fake reviews can lead to a loss of trust in the product and reduce its sales. It can damage a brand’s reputation and create a negative image. It can be used to sabotage the competition.
Interviewer: Yes, that’s well defined.
Priyanka: It is important to mention that fake reviews can be positive, neutral, or negative. For example — A positive review about a bad product can make a buyer lose trust in the Flipkart platform whereas a negative review about a good product can reduce its sales.
Interviewer: That's a good observation.
Priyanka: As we know Flipkart has a lot of categories like electronics, accessories, home décor, personal care, etc. is there any particular category where we are facing this issue?
Interviewer: Yes, we are facing an issue in the electronics category.
Priyanka: Is there any particular metric that we are observing is getting impacted due to this? What I can think of is that if people see good reviews about a product they might purchase it but if it doesn’t meet their expectations, then they will return the product.
Interviewer: Yes, the return rate is high.
Priyanka: To ensure we are on the same page I would like to summarise what we have discussed till now. We talked about what fake reviews are, how they harm the platform and why is it important to do away with fake reviews. We have narrowed down our problem statement by identifying that there is an increase in fake reviews on Flipkart in the electronics category. Now we will discuss what we can do to solve this issue.
Interviewer: That’s well summarised.
Priyanka: Talking about the user journey of a buyer who would post review post-purchase:
User Journey
Ways to handle fake reviews
1. Rule-based Classification using Machine Learning:
We can create a classification engine with if/else rules from heuristics to help us identify fake reviews.
- Fake reviews might post links to other products
- Fake reviews have either excellent or poor grammar
- Fake reviews have extremely negative or positive emotion
- Fake reviews arise from users who have not purchased the product. Currently, Flipkart does not allow users who have not purchased the product to post reviews about the product.
2. Human moderation:
The moderation team will audit the reviews before they are posted on the platform. The reviews should not violate the policy laid down by the platform. Additionally, there should not be any derogatory remarks. Accounts that have previously shown some dubious behaviour will be kept in check.
Human moderation will also come into play when reviews are erroneously removed by ML algorithms due to bias present in the model.
3. Review Analysis using machine learning:
We can build a machine learning model to classify reviews into fake and non-fake by giving a probability likelihood of each review.
Cleaning data: First we have to clean the data by removing any duplicate reviews or outliers. Duplicate reviews can be reviews posted from two different user IDs on the same product or exactly the same review posted by a user on two different products. Outlier reviews can be thought of as those whose rating deviates a lot from the average product rating.
Feature selection: We will take into account some of the important information related to the review like who is the reviewer, what is the content, and what is the product that is being reviewed.
Model building: As we need to classify the reviews into fake and genuine, so we can use logistic regression. We can set a probability score to reflect the likelihood of the review being fake/non-fake. We will use metrics like accuracy or sensitivity vs specificity to track the performance of our model.
To compare our logistic regression model, we can also build a decision tree model and see which one is performing better. If we have imbalanced data i.e. the ratio of fake vs non-fake reviews is not balanced, we should perform upsampling to balance the dataset.
We can use a combination of human moderation and machine learning to obtain the best results and remove bias arising out of machine learning models.
Metrics
After implementing the techniques that we discussed, we can track the following metrics:
- % increase/decrease on the returns made by a user
- % increase/decrease in the periodic sales for the products where fake reviews were identified
- % increase/decrease in average revenue per user
To summarise, we discussed what fake reviews are and how they impact the Flipkart platform. We narrowed down our problem statement by saying that the fake reviews have increased on the Flipkart platform in the electronics category leading to an increase in the return rate. We discussed techniques like rule-based classification, human moderation and review analysis to solve the problem. We identified metrics that will help us track the impact on the platform once the fake reviews have decreased.
Interviewer: That was a great discussion. Thank you.
Comments
Post a Comment