Git Product home page Git Product logo

Comments (2)

Gilles-Narcy avatar Gilles-Narcy commented on August 14, 2024 1

I created the wordclouds using a world cloud generator online (https://www.freewordcloudgenerator.com/) following Terry's advice. I extracted the texts from the tc dataset after cleaning the data from GitHub on Excel. Maybe some text was lost in the process - I'll double-check. I'll take your suggestion about diversity scores converted into bar charts - maybe with different colors for each tag in the manuscript, in order to provide another visualization of the languages-tags correlation.

I'll cite Roni and Clement of course, and make sure to discuss my methodology in my paper. Thank you again for your precious help!

from sandbox-projects.

njr2128 avatar njr2128 commented on August 14, 2024

We just took a look:

  • can you describe how you created these wordclouds?
  • How are you populating the data? We noticed that not all terms/phrases were represented and some seemed cut off
  • What app are you using to generate the clouds?
  • do the colors of the words have significance/meaning?
  • perhaps it is better to generate a vocabulary diversity score for each language and plot it as a bar chart or scattergram?
    --> these questions of methodology (ie how you made these charts) should be included in this repo but also in your final paper

It would be great to use what Roni and Clement have already generated if they are of use to you and your argument. Just remember to cite them

from sandbox-projects.

Related Issues (3)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.