Date on Master's Thesis/Doctoral Dissertation
5-2021
Document Type
Master's Thesis
Degree Name
M.S.
Department
Computer Engineering and Computer Science
Degree Program
Computer Science, MS
Committee Chair
Almaghraby, Adel
Committee Co-Chair (if applicable)
Gentili, Monica
Committee Member
Gentili, Monica
Committee Member
Imam, Ibrahim
Author's Keywords
Word2vec; SVM; text mining
Abstract
Natural Language Processing represents a quantum leap for governance and industries. It enables them to have an insight into hidden patterns and information within their data. In this thesis, we have worked on an important field in Natural Language Processing, which is Text Classification. Our goal is to help restaurant owners to find which dishes customers like more. To do that we have used a dataset from Yelp.com that has 150,000 restaurant reviews, then count the most frequent dishes mentioned. However, this way is not effective except if these reviews are categorized into different restaurants-styles. For this reason, we have used Word2vec with Support Vector Machine algorithms to classify these reviews into four restaurant-style categories (Mediterranean, Indian, Mexican, and Japanese). The experimental result shows that this methodology has successfully achieved a classification accuracy of 87.2%, which shows that the methodology is effective in classifying reviews datasets.
Recommended Citation
Almohaimeed, Saleh Abdullah, "Restaurant style prediction using Word2vec and support vector machine." (2021). Electronic Theses and Dissertations. Paper 3667.
https://doi.org/10.18297/etd/3667