QonTwitterIDDP

This repo contains replication files for IDDP's report about QAnon on Twitter.

Data

In compliance with Twitter's "Content redistribution" guidelines in the Developer Policy, we provide a list of the ID of each tweet in our dataset (link to file). Researchers should be able to use this information to rehydrate tweets (i.e., go from a list of tweet IDs to a list of full tweet objects). There are open-source tools that can be used to rehydrate tweets (for example, Hydrator) if researchers prefer not to write their own script to do this.

The link above points to a page with two files: a CSV file and a .tar.gz file. These two files have the same contents.

Code

We provide a number of Python files that contain the code we used to extract data from our database, then process, analyze, and visualize that data.

To ensure that these Python files contain functional code, we also provide QonTwitter.yml, which contains information about the Python environment we used (via Conda) to execute this code. Researchers looking to replicate this analysis can ensure that the code is functional by creating a virtual environment using this yml file.

To ensure compliance with Twitter's guidelines, we cannot provide intermediate datafiles that we created as part of our extract-process-analyze-visualize pipeline. However, researchers can use the code in these files to replicate our analyses using any dataset of hydrated tweets.

language_breakdown_check.py: creates the chart showing the relative proportion of different languages.
daily_count.py: generates descriptive statistics and visualizations about the number of tweets per day.
confirm_mongo_account_info_w_id.py: generates descriptive statistics and visualizations about:
- the number of users in the dataset
- the number of tweets per user
- the number of unique user descriptions per user
tweet_type_count.py: counts the number of retweets, quote tweets, and original tweets.
get_retwets_w_userid.py: generates descriptive statistics and visualizations about:
- the number of times each tweet was retweeted
- the number of times each account was retweeted
- the number of times each account created retweets
- the number of times accounts retweeted tweets originally sent by themselves
user_account_create_date.py: generates descriptive statistics and visualizations about when accounts that appear in our dataset were created.
user_bio_token_count.py: generates visualizations about the top unigrams, bigrams, and @mentions in user bios.
process_tweet_text.py: generates visualizations about the top unigrams and bigrams in the text of tweets.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QonTwitterIDDP

Data

Code

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
QonTwitter.yml		QonTwitter.yml
README.md		README.md
confirm_mongo_account_info_w_id.py		confirm_mongo_account_info_w_id.py
daily_count.py		daily_count.py
get_retweets_w_userid.py		get_retweets_w_userid.py
language_breakdown_check.py		language_breakdown_check.py
process_tweet_text.py		process_tweet_text.py
tweet_type_count.py		tweet_type_count.py
user_account_create_date.py		user_account_create_date.py
user_bio_token_count.py		user_bio_token_count.py

License

albany-social-media-analysis/QonTwitterIDDP

Folders and files

Latest commit

History

Repository files navigation

QonTwitterIDDP

Data

Code

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages