GitHub
This project is to classify the seniment of amazon customer review
Two different techniques are applied:
Dataset Ref: https://www.kaggle.com/datasets/snap/amazon-fine-food-reviews
Citation J. McAuley and J. Leskovec. From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. WWW, 2013.
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
plt.style.use('ggplot')
import nltk
df = pd.read_csv('/content/Reviews.csv')
print(df.shape)
df = df.head(500)
print(df.shape)
df.head()
ax = df['Score'].value_counts().sort_index() \\
.plot(kind='bar',
title='Count of Reviews by Stars',
figsize=(10, 5))
ax.set_xlabel('Review Stars')
plt.show()