Git Product home page Git Product logo

Comments (15)

Ironholds avatar Ironholds commented on August 20, 2024 3

@statspro1 this should now be fixed; if you run:

devtools::install_github("ropensci/tokenizers")

before installing tidytext, you'll have a version which works swimmingly on CentOS with older GCC versions. It'll be in the next CRAN release, too, but in the meantime grabbing that developer version is the solution, and this bug is officially fixed!

from tidytext.

Ironholds avatar Ironholds commented on August 20, 2024 1

Just a note - I think that this is caused by the dependency tokenizers has on C++11. It may be possible to switch the code to C++98, which would widen the number of compilers it can be used on. I've extended an offer to Lincoln to do that, since it's too makework for him to really be enthusiastic about but would help clear a couple of open issues and expand the usability of tokenizers and its dependent packages.

from tidytext.

juliasilge avatar juliasilge commented on August 20, 2024 1

@Ironholds Thank you SO MUCH for your work in getting this installation issue hammered out. ⭐⭐⭐

from tidytext.

juliasilge avatar juliasilge commented on August 20, 2024

Hmmmm, looks like the tokenizers package (a dependency of tidytext) is not installing correctly for you.
https://github.com/ropensci/tokenizers
I imagine this is related to how the C++ code is being compiled (your C compiler?) but I will admit this is not my area of expertise. I am going to point you to the tokenizers repo and see if they can help.

from tidytext.

statspro1 avatar statspro1 commented on August 20, 2024

Thanks for the feedback. I already opened an issue on the package page and upgraded Centos and GCC. But the compilation is still failing for that package.

cat /etc/centos-release
CentOS release 6.8 (Final)

gcc --version
gcc (GCC) 4.8.2 20140120 (Red Hat 4.8.2-15)

I am wondering if other folks are having the same issue.

from tidytext.

juliasilge avatar juliasilge commented on August 20, 2024

I haven't heard of anybody else running into this issue at this point, I'm afraid.

from tidytext.

mo58 avatar mo58 commented on August 20, 2024

Same error for me. Im not sure if it is because of the gcc version 4.8. But probably it is.
Anyone solution?

from tidytext.

mo58 avatar mo58 commented on August 20, 2024

Tried it with gcc 4.4.7. Still not working

from tidytext.

juliasilge avatar juliasilge commented on August 20, 2024

Have you tried getting some help over at the tokenizers package?
https://github.com/ropensci/tokenizers
The tokenizers package is a dependency of tidytext, and does use compiled C++ code. You are on Red Hat Linux like the original poster?

from tidytext.

mo58 avatar mo58 commented on August 20, 2024

yep I have already asked the question there. All versions (R,gcc,binutils etc) are up to date. Still searching what is causing the error.

from tidytext.

anglax avatar anglax commented on August 20, 2024

Hello, I'm getting the same issue, has anyone come up with a solution?

from tidytext.

wooopenr avatar wooopenr commented on August 20, 2024

I am having same issue with below errors. I am using Linux 6.8 with gcc-c++ 4.4

shingle_ngrams.cpp:28: error: expected initializer before ‘:’ token
shingle_ngrams.cpp:35: error: expected primary-expression before ‘ngram_out_len’
shingle_ngrams.cpp:35: error: expected ‘)’ before ‘ngram_out_len’
shingle_ngrams.cpp:35: error: ‘ngram_out_len’ was not declared in this scope
shingle_ngrams.cpp:36: error: ‘ngram_out_len’ was not declared in this scope

shingle_ngrams.cpp:44: error: ‘len’ was not declared in this scope

from tidytext.

juliasilge avatar juliasilge commented on August 20, 2024

Yep, those are errors from the tokenizers package. I see that you have asked over there on an issue in that repo, which is the right way to go. You might also show them the exact errors you are getting, like you did here.

from tidytext.

Ironholds avatar Ironholds commented on August 20, 2024

Noo problem! Although now it looks like I'm clearing tokenizers' bug backlog too. The reward for a job well done, etc, etc ;)

from tidytext.

github-actions avatar github-actions commented on August 20, 2024

This issue has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.

from tidytext.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.