JournalistGPT

JournalistGPT is designed to revolutionize the way news events in India are analyzed by leveraging advanced large language models (LLMs). It can efficiently query, summarize, and interconnect multiple news events, allowing users to track developments across different timeframes and sources. By identifying patterns, common themes, and underlying narratives, the system helps researchers gain a deeper understanding of the socio-political and economic factors influencing events. Furthermore, it provides contextual analysis by linking past incidents with present occurrences, offering a comprehensive timeline of interconnected events. This functionality is particularly valuable for researchers, policymakers, and analysts who seek to uncover the broader implications of news developments beyond isolated incidents.

Beyond mere summarization and event connection, JournalistGPT extends its capabilities to detect flaws, misinformation, or systemic shortcomings that may have contributed to a particular event. By analyzing multiple reports, statements, and expert opinions, it can highlight inconsistencies, media biases, or governance lapses that led to the situation. Advanced sentiment and bias detection modules help assess the tone and framing of news coverage, ensuring a more objective and holistic understanding of events. Additionally, JournalistGPT can generate structured reports, visualizing event interconnections through graphs and charts, making complex event relationships easier to comprehend. As a tool strictly meant for research purposes, it aims to advance the application of LLMs in media analysis while ensuring ethical and responsible use in studying Indian news narratives.

GPT-Trained on Inshorts News Dataset (2023)

Overview

This repository contains a GPT model trained on a news dataset from Inshorts, spanning from 2005 to December 2023. The model has been fine-tuned to generate news summaries, headlines, and insights based on extensive real-world data.

Access to the trained models is available upon request. You can contact me via LinkedIn.

About Inshorts

Inshorts is a news aggregation platform that provides concise summaries of news articles, typically within 60 words. The dataset used in this project was sourced from Kaggle (Inshorts Dataset - English) and includes comprehensive news details over nearly two decades.

Disclaimer

I do not hold any rights to the dataset. The dataset was provided on Kaggle as open-source and is used solely for research and educational purposes.

Setup and Execution

To use this project, follow these steps:

Download the dataset: Obtain the CSV file from Kaggle (search for Inshorts Dataset - English).
Convert to input format: Open the Jupyter Notebook within the data/ folder and use it to transform the CSV into input.txt.
Prepare data: Move input.txt into the data/ directory.

Set up the environment:

conda create --name journalist python=3.9  
conda activate journalist  
pip install -r requirements.txt

Prepare dataset for training:
```
python data/news_char/prepare.py  
```
Train the model:
```
python config/train_news_base_gpt.py  
```

Adjust training parameters (Optional):
You can modify config/train_news_base_gpt.py for better performance:

gradient_accumulation_steps = 1  
batch_size = 64  
block_size = 256  # Context of up to 256 previous characters  

n_layer = 6  
n_head = 6  
n_embd = 384  
dropout = 0.2

Trained Model Output:
The trained model will be available at:
```
out_news_char/  
```

Apple MacBook MPS & CUDA Compatibility

The code is automatically optimized to run on Mac MPS (Apple M1/M2/M3 chips) for faster training.
If you wish to use CUDA (NVIDIA GPUs), update the device setting in config/train_news_base_gpt.py:
```
device = "cuda"  # Change from "mps" to "cuda" for NVIDIA GPU support
```

Future Updates

More models will be added to fine-tune open-source models from Hugging Face for enhanced performance and improved news summarization capabilities. Stay tuned for updates!

Credits

This project is inspired by the work of Andrej Karpathy, particularly his advancements in training GPT models on character-level datasets.

For any inquiries or access requests, feel free to reach out on LinkedIn.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
data		data
LICENSE		LICENSE
README.md		README.md
bench.py		bench.py
configurator.py		configurator.py
model.py		model.py
sample.py		sample.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

JournalistGPT

GPT-Trained on Inshorts News Dataset (2023)

Overview

About Inshorts

Disclaimer

Setup and Execution

Apple MacBook MPS & CUDA Compatibility

Future Updates

Credits

About

Uh oh!

Releases

Packages

Languages

License

dubeyakshat07/JournalistGPT

Folders and files

Latest commit

History

Repository files navigation

JournalistGPT

GPT-Trained on Inshorts News Dataset (2023)

Overview

About Inshorts

Disclaimer

Setup and Execution

Apple MacBook MPS & CUDA Compatibility

Future Updates

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages