Brand perception and sentiment analysis

Baseline TF-IDF + logistic regression sentiment models for brand-directed tweets, with optional Hugging Face backends (DistilBERT, BERTweet) for experiments defined in models/sentiment/model_factory.py.

Layout

Path	Purpose
`data/datasets/`	CSV datasets (e.g. Twitter sentiment, processed GoEmotions splits)
`data/processed/`	Train/val splits and derived tables from preprocessing
`notebooks/`	Exploratory analysis (e.g. GoEmotions EDA)
`models/sentiment/`	Training, evaluation, prediction, config, preprocessing
`artifacts/`	Trained weights, reports (gitignored; regenerate locally)

Setup

python -m venv .venv
source .venv/bin/activate   # Windows: .venv\Scripts\activate
pip install -r requirements.txt

Train the sklearn baseline

python -m models.sentiment.train

Predict

Single-call API (from Python): from models.sentiment.predict import predict_sentiment.

Batch CSV (sklearn joblib model):

python -m models.sentiment.predict sklearn-batch --input-path data/datasets/twitter-sentiment/Dataset\ -\ Test.csv --output-path data/processed/twitter_sentiment_test_with_predictions.csv

Batch CSV (saved Hugging Face sequence classifier directory):

python -m models.sentiment.predict hf-batch --model-path /path/to/saved_hf_model --input-path data/datasets/gomotions_processed/goemotions_test.csv --output-path predictions.csv

Sample entry point

python main.py
# Brand Perception & Sentiment Analysis

News article pipeline for brand extraction, topic modelling, sentiment analysis, and aspect-based sentiment extraction.

## Structure

models/ absa/ — PyABSA EMCGCN triplet extraction wrapper NER/ — brand entity extraction (spaCy + rules) Topic-Modeling/ — LDA topic modelling


## ABSA

The pipeline supports multi-aspect extraction with PyABSA using the Hugging Face checkpoint `deepakm10/brand-absa-emcgcn`.

Install dependencies:

```bash
pip install pyabsa

Control ABSA with:

export BRAND_PERCEPTION_ENABLE_ABSA=1
export BRAND_PERCEPTION_ABSA_MODEL=deepakm10/brand-absa-emcgcn

If ABSA is disabled or the model fails, the pipeline falls back to the existing stub sentiment behavior and emits a default aspect of general.

Run the local demo again:

python demo_integration.py

Run the API:

uvicorn api.app:app --reload

Run the Streamlit dashboard:

streamlit run dashboard.py

If the API is not running, the dashboard will fall back to built-in sample data so the layout remains usable for demos and presentation prep.

Test API endpoints:

curl http://127.0.0.1:8000/health
curl http://127.0.0.1:8000/analytics/summary
curl http://127.0.0.1:8000/analytics/topics
curl http://127.0.0.1:8000/analytics/aspects
curl "http://127.0.0.1:8000/analytics/timeseries?rolling_window_days=7"

NER

Runs after news_dailyworker.Preprocessing().runner(). Adds ner_brands and ner_raw_json columns to the daily article CSV.

from models.NER.ner_pipeline import NERPipeline
import pandas as pd

df = pd.read_csv("data/dailyworker/2025-01-15.csv")
df = NERPipeline().run_on_dataframe(df)

Evaluate:

python -m models.NER.evaluate_ner
python -m models.NER.evaluate_ner --csv path/to/rating.csv

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.vscode		.vscode
EDA		EDA
__pycache__		__pycache__
analytics		analytics
api		api
data		data
integrated_datasets		integrated_datasets
models		models
notebooks		notebooks
pipeline		pipeline
scripts		scripts
services		services
visuals		visuals
.DS_Store		.DS_Store
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
dashboard.py		dashboard.py
data_aggregation.py		data_aggregation.py
db_output.py		db_output.py
demo_integration.py		demo_integration.py
main.py		main.py
output.txt		output.txt
requirements.txt		requirements.txt
temporal_aggregation.py		temporal_aggregation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Brand perception and sentiment analysis

Layout

Setup

Train the sklearn baseline

Predict

Sample entry point

NER

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Brand perception and sentiment analysis

Layout

Setup

Train the sklearn baseline

Predict

Sample entry point

NER

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages