I am a researcher, I've got a large social media dataset and want to answer my research question but I don't know how to structure the data