GitHub - leighanne77/sentiment-analysis-simple-demo-model: With ML, in theory, we can deliver sentiment analysis at scale. To demonstrate “how” ML can generate sentiment analysis at scale, I will use one simple model, and analyze one historical data set made up of one mode of data (digital text) to conduct sentiment analysis in just two ways: positive or negative.

Sentiment Analysis: A Simple Model for Demonstration Guide

What is different now is that with ML and generative AI, sentiment analysis can be conducted at scale on any topic at any time, at least in theory. Data scientists with access to sufficient amounts of streaming (live) or historical (regularly updated) data, of any mode (video, text, audio) can now do in a day what used to take professional pollsters weeks and months to assemble.

This simple model is to demonstrate how ML and generative AI are used to conduct sentiment analysis. This model analyzes one data set made up of historical data of reviews on movies and TV shows in a single mode (digital text) and return the sentiment analysis in two ways: positive or negative.

Tool and Dependencies Setup: Imported dependencies including the transformers library, huggingface_hub, and the metrics from the datasets library Selected the base model: distilBERT, a variation of BERT - distilBERT is BERT's streamlined version, crafted by HuggingFace through knowledge distillation. It mirrors BERT's capabilities but is leaner in terms of parameters
Dataset Set Up: The IMDB movie review dataset
Tokenization: Tokenize the dataset via the DistilBERT tokenizer (after dividing the dataset into training and test sets)
Training the Model: The focus is on classification tasks for sentiment analysis, tapping into DistilBERT's expertise Used the AutoModelForSequenceClassification from the transformers suite This was tailored for sequence classification Initialized the model AutoModelForSequenceClassification.from_pretrained("distilbert-base-uncased", num_labels=2) Libraries and API The Trainer API was used The TrainingArguments library allows customizing training parameters for improved efficiency learning rates batch dimensions epochs Using the trainer's .train() method, kickstart the training with cycles of forward and backward passes and optimization
Evaluating the Model: Used the compute_metrics function to analyze evaluation predictions to compute and assess predictions against actual labels Used the trainer's .evaluate() method to calculate the evaluation data: Retrieve accuracy score - load_metric F1 score - load_metric("f1") Extracted the logits and labels from eval_pred calculate the predictions by selecting the index with the maximum value along the last axis (using np.argmax(logits, axis=-1)).

Resources Used

BERT Documentation (Hugging Face): https://huggingface.co/docs/transformers/model_doc/bert BERT Official Guide (Hugging Face): https://huggingface.co/docs/transformers/tasks/sequence_classification DistilBERT Version, Documentation: https://arxiv.org/abs/1910.01108v4

References

Deeply Moving: Deep Learning for Sentiment Analysis. “Deeply Moving: Deep Learning for Sentiment Analysis.” Accessed October 13, 2023. http://nlp.stanford.edu/sentiment/index.html.

“Distilbert-Base-Uncased-Finetuned-Sst-2-English · Hugging Face,” June 1, 2023. https://huggingface.co/distilbert-base-uncased-finetuned-sst-2-english.

Varshney, Neeraj. “Domain Adaptation for Sentiment Analysis.” Analytics Vidhya (blog), June 2, 2020. https://medium.com/analytics-vidhya/domain-adaptation-for-sentiment-analysis-d1930e6548f4.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
Simple_Sentiment_Analysis_Model_for_Demonstration_.ipynb		Simple_Sentiment_Analysis_Model_for_Demonstration_.ipynb
With_correct_padding_for_tokenizer_Sentiment_Analysis_Text_Only_Redo_of_Protytpe_Oct_13,_2023_v_0_2.ipynb		With_correct_padding_for_tokenizer_Sentiment_Analysis_Text_Only_Redo_of_Protytpe_Oct_13,_2023_v_0_2.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages