Skip to content

Datasets: Overview & Implementation

Jack J Burleson // Lucius Morningstar edited this page Jan 10, 2025 · 1 revision

Question Datasets

Wiki Page: Question Datasets in the Repository

Overview

This page provides a comprehensive overview of the various question datasets within the repository, organized by subject matter and application. Each dataset has been carefully curated to support specific research, educational, and experimental purposes, with metadata, usage instructions, and version histories provided for each.

Repository Structure

The question datasets are located in the /Questions_DB/ directory, organized into subfolders by subject:

Questions_DB/
├── README.md
├── Astronomy/
│   └── ASTRN-v1.md
├── Geography/
│   └── GEO-v1.md
├── Literature/
│   └── LIT-v1.md
├── Mathematics/
│   ├── MATH-v1.md
│   ├── NY-math-questions.md
│   └── PDFs/
│       └── markdown-placeholder.md
└── Psychology/
    └── PSYC-v1.md

Dataset Summaries

Below is a detailed description of each dataset, including its content focus, intended use, and version history.

Astronomy Dataset (ASTRN-v1.md)

  • Focus: Topics in astronomy, including celestial mechanics, planetary science, and cosmology.
  • Primary Use: Testing knowledge of astronomical concepts and problem-solving skills.
  • Examples:
    • "What would happen to Earth's tides if the Moon disappeared?"
    • "How would a satellite's trajectory change if gravity doubled?"
  • Version History:
    • v1.0 (Initial release): Comprehensive question set on celestial phenomena.

Geography Dataset (GEO-v1.md)

  • Focus: Geographic phenomena, spatial relationships, and environmental dynamics.
  • Primary Use: Evaluating understanding of geographic concepts and reasoning.
  • Examples:
    • "Calculate the shortest route between two cities on a globe."
    • "Analyze the impact of rainfall patterns on regional agriculture."
  • Version History:
    • v1.0 (Initial release): Question set covering physical and human geography.

Literature Dataset (LIT-v1.md)

  • Focus: Literary analysis, including themes, motifs, and character studies.
  • Primary Use: Testing comprehension and interpretative skills in literature.
  • Examples:
    • "Identify the metaphors in this passage and explain their significance."
    • "What are the primary themes in this poem?"
  • Version History:
    • v1.0 (Initial release): Core question set for literary interpretation.

Mathematics Dataset (MATH-v1.md & NY-math-questions.md)

  • Focus: Mathematical concepts, problem-solving, and logical reasoning.
  • Primary Use: Assessing mathematical proficiency and analytical thinking.
  • Examples:
    • "Prove that the square root of 2 is irrational."
    • "Design an algorithm to calculate Fibonacci numbers."
  • Additional Resources: Includes PDF placeholders for structured worksheets.
  • Version History:
    • v1.0 (Initial release): Basic and advanced mathematical problems.
    • NY-math-questions.md: Special collection for advanced problem-solving.

Psychology Dataset (PSYC-v1.md)

  • Focus: Psychological theories, behavioral analysis, and cognitive principles.
  • Primary Use: Exploring psychological concepts and their applications.
  • Examples:
    • "What does this sequence of actions reveal about decision-making?"
    • "Explain the core principles of Carl Jung’s archetypes."
  • Version History:
    • v1.0 (Initial release): Foundational questions in psychology.

Metadata and Documentation

Each dataset file includes the following metadata:

  • Title: Dataset name and version.
  • Description: Brief overview of the dataset’s focus and purpose.
  • Version: The current version number has changes tracked in the repository’s CHANGELOG.md file.
  • Author: Contributor(s) to the dataset.
  • Usage Instructions: Guidelines for applying the dataset in research or testing.

Usage Guidelines

  1. Selection: Choose the dataset that aligns with your research or educational objectives.
  2. Implementation:
    • Review the README.md file in each dataset folder for specific usage details.
    • Follow guidelines for incorporating the dataset into experimental or educational workflows.
  3. Analysis:
    • Use provided examples as templates for creating additional questions.
    • Document responses and findings for future reference.

Contributing

Contributions to the question datasets are welcome. To contribute:

  1. Fork the repository.
  2. Add or modify datasets within the /Questions_DB/ directory.
  3. Update metadata and documentation as necessary.
  4. Submit a pull request with a detailed description of changes.

License

All datasets are licensed under the MIT License. See the LICENSE file in the root directory for details.


For additional details or assistance, consult the repository’s main README.md or contact the project maintainers.

Project Resources

Key Wiki Components

Key Repository Components

Featured Profiles

  • Alan Turing (Historical/Computational)
  • Professor Athena (Academic/Analytical)
  • Professor Milgrim (Authority-based)
  • Saint Enigma (Mysterious/Cryptic)
  • Scarlet Quinn (Strategic/Persuasive)
  • Control Profiles (Baseline)
Clone this wiki locally