Skip to content

BERTCGAForcaster Class #274

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 35 commits into
base: master
Choose a base branch
from
Open

BERTCGAForcaster Class #274

wants to merge 35 commits into from

Conversation

sonqt
Copy link

@sonqt sonqt commented Feb 28, 2025

Description

This PR introduces several enhancements and new features, including:

  • Forecaster Class: Implements support for auto-encoding transformer models such as BERT, RoBERTa, SpanBERT, DeBERTa, and ELECTRA.
  • New Metric for forecaster.summarize(): Adds an additional evaluation metric to improve summarization capabilities.
  • BERTCGAModel Class: Implements a new model class specifically designed for forecasting with BERT-based architectures.
  • Jupyter Notebooks:
    • Demonstration notebooks showcasing how to use BERTCGAModel.
    • Notebooks for reproducing the performance results reported in the paper.
  • Enhanced Download Configuration: Modifies download_config to enable downloading fine-tuned BERT-base and RoBERTa-large models from the Zissou server.

@jpwchang
Copy link
Collaborator

In addition to the comments above, one additional open point of discussion: so far, we have no documentation pages for subclasses of ForecasterModel (ForecasterModel itself has documentation, located at the bottom of the Forecaster documentation page). This is mainly because that class exists somewhat outside the ConvoKit class hierarchy (it is not a Transformer subclass). In the past, this was alright since we didn't have many concrete ForecasterModel subclasses (just CRAFT and the BoW baseline). But now that we're getting more and more models, you may imagine a user wanting to quickly understand the difference between the many options they have for the forecaster_model parameter in Forecaster, and right now there is nowhere they can go to in the docs to understand this.

It seems to me we definitely need documentation pages now for every ForecasterModel subclass. The discussion to be had is exactly where on the website these pages would go (since they are not part of the Transformer hierarchy, which is the main organizational system for the documentation site right now). Two ideas I can think of are (1): as a new subpage under the Forecaster documentation page, and (2) as a standalone header on the site, similar to how "Utilities" is currently treated (that page also being outside the Transformer hierarchy). But I am also open to other ideas.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants