Add TF-IDF + LinearSVC TweetEval sentiment example tuned with Optuna (with smoke test & README) #333

aish-warya-iyer · 2025-10-11T16:39:28Z

This example adds a short, self-contained demonstration of using Optuna to tune a TF-IDF + LinearSVC pipeline for sentiment classification on the TweetEval sentiment dataset (three labels: negative, neutral, positive).

What it does

Loads the TweetEval sentiment dataset using the datasets library.

Tunes both the TF-IDF vectorizer (feature count, n-gram range, etc.) and LinearSVC parameters (C, loss, class_weight) using Optuna.

Uses macro-F1 on the validation split as the objective (1 – macro-F1 minimized).

Retrains the best configuration on train + validation and prints a test report.

Files added

examples/sklearn/svm_tfidf_tweeteval_sentiment.py – main script

examples/sklearn/svm_tfidf_tweeteval_sentiment.md – short usage notes

tests/test_svm_tfidf_tweeteval.py – quick smoke test for CI

How to run
python examples/sklearn/svm_tfidf_tweeteval_sentiment.py --n-trials 20 --max-train 20000
pytest -q # optional quick test

Notes

Keeps runtime light by allowing the --max-train argument to limit samples.

Demonstrates how Optuna can help search SVM + text-feature spaces efficiently.

No external dependencies beyond datasets, scikit-learn, and optuna.

… add smoke test and README snippet

github-actions · 2025-10-19T23:04:34Z

This pull request has not seen any recent activity.

github-actions · 2025-11-03T23:04:48Z

This pull request was closed automatically because it had not seen any recent activity. If you want to discuss it, you can reopen it freely.

c-bata · 2025-11-04T01:30:52Z

@aish-warya-iyer Please feel free to reopen this after fixing all CI checks 🙏

aish-warya-iyer added 3 commits October 11, 2025 09:37

Add README snippet for SVM TF-IDF TweetEval example

2294b70

Add TF-IDF + LinearSVC TweetEval sentiment example tuned with Optuna;…

2b9f0f9

… add smoke test and README snippet

CI: make smoke test network-free by testing --help output only

b33d3c4

github-actions bot added the stale Exempt from stale bot labeling. label Oct 19, 2025

github-actions bot closed this Nov 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add TF-IDF + LinearSVC TweetEval sentiment example tuned with Optuna (with smoke test & README) #333

Add TF-IDF + LinearSVC TweetEval sentiment example tuned with Optuna (with smoke test & README) #333

Uh oh!

aish-warya-iyer commented Oct 11, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 19, 2025

Uh oh!

github-actions bot commented Nov 3, 2025

Uh oh!

c-bata commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add TF-IDF + LinearSVC TweetEval sentiment example tuned with Optuna (with smoke test & README) #333

Add TF-IDF + LinearSVC TweetEval sentiment example tuned with Optuna (with smoke test & README) #333

Uh oh!

Conversation

aish-warya-iyer commented Oct 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 19, 2025

Uh oh!

github-actions bot commented Nov 3, 2025

Uh oh!

c-bata commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

aish-warya-iyer commented Oct 11, 2025 •

edited

Loading