technical report

a longitudinal twitter sentiment dataset collected using this tool and its analysis can be found here.

files

methods.py

contains:

Criterion: a criterion to judge whether a single text sentence falls into any of the possible categories.
- KeywordCriterion: a type of criterion that uses keywords. keywords can be defined using a dict.
DistantSupervisor: applies a Criterion to an input file and categorise all tweets in that file, creating a new file for each category. option to also include what evidence based on which it categorised each tweet.

extract_tweets_by_keywords.py

example script for using a KeywordCriterion to categorise tweets into positive and negative based on emoticons and emojis.

analyse_keywords.py

example script for counting how many times each keyword was used as evidence to categorise a tweet (if any) and the how many keywords were found in each categorised tweet.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
legacy		legacy
postprocessing		postprocessing
utils		utils
README.md		README.md
all_counts.pkl		all_counts.pkl
analyse_keywords.py		analyse_keywords.py
extract_tweets_by_keywords.py		extract_tweets_by_keywords.py
methods.py		methods.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

technical report

files

methods.py

extract_tweets_by_keywords.py

analyse_keywords.py

About

Releases

Packages

Languages

wenjie-yin/distant-supervision-tweets

Folders and files

Latest commit

History

Repository files navigation

technical report

files

methods.py

extract_tweets_by_keywords.py

analyse_keywords.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages