By tokenization, removing stop words and punctuations, stemming, lemmatization, human language is separated into fragments, words are transformed into numbers. After cleaning the data, several machine learning methods including Logistic Regression, Random Forest, XGBoost were applied to predict if the customers' reviews are negative or positive.