TeleQuAD: Telecom Question Answering Dataset Suite

TeleQuAD is a suite of question-answering datasets and models specifically designed for the telecommunications domain. It provides various QA capabilities including extractive, generative, retrieval-augmented generation (RAG), and tabular structured data question answering.

Repository Structure

TeleQuAD is organized into the following task-specific subdirectories, each containing the respective dataset:

TeleQuAD-Extractive: The extractive QA dataset based on technical documentation from 3GPP documents.
TeleQuAD-Tabular: QA systems for table structured telecom data (specs, configurations, etc.)

Usage

Clone the repository and change to the directory.
Choose your QA task type and change to the relevant subdirectory.
Follow the task-specific README available for each dataset in the respective folder.

Contributing to the Dataset

Contributions to the dataset are welcome, please raise a pull request and we would review the changes.

Usage of TeleQuAD in Literature

[1] Holm, Henrik. "Bidirectional Encoder Representations from Transformers (BERT) for question answering in the telecom domain: Adapting a BERT-like language model to the telecom domain using the electra pre-training approach." (2021).

[2] Gunnarsson, Maria. "Multi-hop neural question answering in the telecom domain.)" LTH, Lund University: Lund, Sweden(2021).

[3] Bissessar, Daniel and Alexander Bois. "Evaluation of methods for question answering data generation: Using large language models." (2022).

[4] Nimara, Doumitrou Daniil, Fitsum Gaim Gebre and Vincent Huang. "Entity Recognition in Telecommunications using Domain-adapted Language Models." 2024 IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN 2024).

[5] Karapantelakis, Athanasios, et al. "Using Large Language Models to understand telecom standards." 2024 IEEE International Conference on Machine Learning for Communication and Networking (ICMLCN 2024).

[6] Roychowdhury, Sujoy, Sumit Soman, HG Ranjani, Avantika Sharma, Neeraj Gunda and Sai Krishna Bala. “Evaluation of Table Representations to Answer Questions from Tables in Documents : A Case Study using 3GPP Specifications”. arXiv preprint arXiv:2408.17008 (2024).

[7] Roychowdhury, Sujoy, Sumit Soman, HG Ranjani, Neeraj Gunda, Vansh Chhabra and Sai Krishna Bala. "Evaluation of RAG Metrics for Question Answering in the Telecom Domain." Workshop on Foundation Models in the Wild, International Conference on Machine Learning (ICML 2024).

[8] Roychowdhury, Sujoy, Sumit Soman, HG Ranjani, Neeraj Gunda, Vansh Chhabra, Subhadip Bandyopadhyay and Sai Krishna Bala. “Investigating Distributions of Telecom Adapted Sentence Embeddings for Document Retrieval”, Workshop on Next-Gen Networks through LLMs, Action Models, and Multi-Agent Systems, International Conference on Communications (ICC 2025).

Citation

If you use TeleQuAD in your research, please cite:

@article{
  telequad2025,
  title={TeleQuAD: A Suite of Question Answering Datasets for the Telecom Domain},
  author={Fitsum Gebre and Henrik Holm and Maria Gunnarsson and Doumitrou Nimara and Jieqiang Wei and Vincent Huang and Avantika Sharma and H G Ranjani},
  booktitle={Ericsson},
  year={2025}
  }

License

This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.

Acknowledgments

TeleQuAD is developed and maintained by Ericsson AB and published for research purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
extractive/v4		extractive/v4
tabular		tabular
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TeleQuAD: Telecom Question Answering Dataset Suite

Repository Structure

Usage

Contributing to the Dataset

Usage of TeleQuAD in Literature

Citation

License

Acknowledgments

About

Releases

Packages

Contributors 2

EricssonResearch/TeleQuAD

Folders and files

Latest commit

History

Repository files navigation

TeleQuAD: Telecom Question Answering Dataset Suite

Repository Structure

Usage

Contributing to the Dataset

Usage of TeleQuAD in Literature

Citation

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages