Git Product home page Git Product logo

ph-vaccinebrand-discourse's Introduction

2021 Philippine Vaccine Brand Preference Discourse on Twitter

This repository contains the files that were used in the ongoing undergraduate research of Martha Ysaac entitled, "A Corpus-Assisted Critical Discourse Analysis of Tweets on the COVID-19 Vaccine Brand Discourse in the Philippines". Visuals of the data that were presented in the research can be found here.

Information about the Data

  • Academic access was required to extract Tweets from the year 2021. Details on this can be read here.
  • The Tweets that were collected were from March 1, 2021 up until December 31, 2021.
  • Five vaccine brands were chosen as the keywords for extracting Tweets related to the PH COVID-19 vaccine brand Twitter discourse: Sinovac, Pfizer, Moderna, Astrazeneca, and Sputnik.
  • Tagalog was the chosen language for the language parameter in the query.
  • Details on building a query can be found here.

Python Files

  1. vacbrand_twitter - Program for scraping Tweets using tweepy and the Twitter API
  2. vacbrand_find_string - Program for finding a specific substring from the text

Text Files

  1. alltweets_sorted - Tweets from all vaccine brands sorted alphabetically
  2. alltweets_sorted_filtered - Tweets from alltweets_sorted with punctuations removed and all letters in smallcaps
  3. smallcaps_sinovac - Tweets containing "sinovac" with punctuations removed and all letters in smallcaps
  4. smallcaps_pfizer - Tweets containing "pfizer" with punctuations removed and all letters in smallcaps
  5. smallcaps_moderna - Tweets containing "moderna" with punctuations removed and all letters in smallcaps
  6. smallcaps_astrazeneca - Tweets containing "astrazeneca" with punctuations removed and all letters in smallcaps
  7. smallcaps_sputnik - Tweets containing "sputnik" with punctuations removed and all letters in smallcaps

CSV Files

  1. vacbrand_tweetnumber - Table containing the number of Tweets collected per vaccine brand and month
  2. alltweets_mostusedwords - Table containing the top 25 most frequently used words in alltweets_sorted.txt
  3. pronoun_frequency - Table containing frequency of all pronouns
  4. ko_collocations - Table containing the top 10 collocations of the singular Tagalog pronoun "ko" ("me" or "my" in English) and their frequencies
  5. officialmentions_frequency - Table containing the Twitter user accounts relevant to the Philippine COVID-19 vaccine brand discourse on Twitter and their frequencies

ph-vaccinebrand-discourse's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.