Skip to content

Tessel11/Tessel_Haagen_Trends_In_Art_History

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Trends in Art History: Text Mining Thesis

This repository contains the code, data, and analysis for the Master's thesis project "Trends in Art History" by Tessel Haagen. The project applies text mining and topic modeling techniques to historical art texts to uncover trends and patterns.

Project Structure

  • src/
    Source code for data processing, topic modeling, and utility functions.

  • notebooks/
    Jupyter notebooks for exploratory data analysis, model development, and visualization.

  • data/
    Contains raw and processed data (large files are gitignored).

  • figures/
    Output figures and plots generated from the analysis.

  • survey answers/
    Contains survey data, topic analysis CSVs, and related scripts/notebooks.

  • pyproject.toml, poetry.lock
    Project dependencies and environment management using Poetry.

Getting Started

  1. Clone the repository:

    git clone <repo-url>
    cd Tessel_Haagen_Trends_In_Art_History
  2. Install dependencies: Make sure you have Poetry installed.

    poetry install
  3. Run Jupyter notebooks:

    poetry run jupyter notebook

    Open and run notebooks in the notebooks/ or survey answers/ directories.

Main Features

  • Topic modeling with BERTopic and KeyBERT.
  • Sentiment analysis and survey evaluation.
  • Visualization of topic and sentiment trends over time.
  • Scripts for preprocessing and analyzing historical art texts.

Data

Some data directories (e.g., data/ecco/, data/eebo/) are excluded from version control due to size. You can get the data from https://textcreationpartnership.org/faq/#faq05

Author

Tessel Haagen
Master's Thesis, Text Mining

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors