A Public Dataset on the 2024 U.S. Presidential Election on Twitter/X
We just released the first version of a large-scale dataset capturing political discourse on Twitter/X related to the 2024 U.S. Presidential Election. The dataset consists of over 22 million posts collected between May 1, 2024 and July 31, 2024, using a custom-built scraper targeting election-specific hashtags, political figures, and major events.
This work is co-authored by Ashiwin Balasubramanian, Vito Zou, Hitesh Narayana, Christina You, and Emilio Ferrara.
Abstract
In this paper, we introduce a dataset comprising 22 million publicly available posts on X.com (formerly Twitter),
collected from May to July 2024. Using a targeted scraping strategy focused on keywords tied to key political figures,
events, and narratives, we aligned our data collection with the U.S. election cycle to study real-time discourse,
sentiment, and misinformation.
We also present a preliminary analysis of dominant hashtags and topics, laying the groundwork for future research on
online political influence.
Links
- SSRN Paper: View on SSRN
- Dataset on GitHub: sinking8/usc-x-24-us-election
- DOI: 10.2139/ssrn.5018883
Citation
@misc{balasubramanian2024public,
title={A Public Dataset Tracking Social Media Discourse about the 2024 U.S. Presidential Election on Twitter/X},
author={Balasubramanian, Ashiwin and Zou, Vito and Narayana, Hitesh and You, Christina and Ferrara, Emilio},
year={2024},
note={SSRN Working Paper},
url={https://ssrn.com/abstract=5018883}
}