Git Product home page Git Product logo

Comments (4)

renospec avatar renospec commented on June 18, 2024

Perhaps Name parsing/Training is not a well embraced function of parserator.
I have tested Probable People and was hoping for better accuracy in training with Name formats in Parserator.

from parserator.

jeancochrane avatar jeancochrane commented on June 18, 2024

Hey @renospec,

Thanks for waiting on this. Can you give me a better sense of what you're trying to do? If you're just looking to alter/improve the behavior of name parsing, you might find it easier to develop off of the probablepeople library and retrain the model yourself. We don't have probablepeople-specific docs for this yet, but the guide for usaddress should be nearly identical.

from parserator.

renospec avatar renospec commented on June 18, 2024

Thank You
I have developed with the ProbablePeople Library and have trained the model for my data.
After the training session, I started testing different name formats and found that name formats that were not trained for were not successfully parsed. So that means that I have to train all variations of name formats....!

from parserator.

jeancochrane avatar jeancochrane commented on June 18, 2024

Thanks for your patience on this @renospec! It's been a busy week on my end.

When you were developing probablepeople, did you train it on the canonical training data in addition to your new data? As per the Building & Testing the Code section of the docs, that command should look like this:

 parserator train name_data/labeled/labeled.xml,name_data/labeled/company_labeled.xml probablepeople

That's all I can think of off the top of my head that might be causing your error here. If you did that correctly, then the next step will be for me to take a look at your new training data and the name formats you're testing to see if I can reproduce your error. Are you comfortable sharing that data?

from parserator.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.