Skip to content

[ChatLSE] Explore Network X for course related content #113

@KristinaD1910

Description

@KristinaD1910

Our dataset now contains much of the course-related content from the lse calendar page. However, if the database does not include some information on a particular course, it will answer with the information from another.

Potential Solution

Collect all the course related data from the dataset and conduct network analysis over it (using NetworkX as recommended by Jon).

Hypothetical Pipeline (with vector search).

  • Use function calling to categorise the user query as being course-related.
  • Focus the attention in the pipeline towards the various communities and similarities between all the course-related content (within itself and the user query).
  • For example, regarding the BSc Economics programme, the relevant data should include all courses available on this programme (as well as other relevant information) and only this.
  • Point the vector search to only the documents which fit the criteria.

Note A preliminary experiment uses a filtered dataset for course-related content, which can be found on the SharePoint, as it is too large to be uploaded to Github.

Metadata

Metadata

Assignees

Labels

experimenting 🔬We want to explore some data or perform some analysis that will become a feature later

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions