Git Product home page Git Product logo

bi-ner's Introduction

Streamlit app to Perform NER on Wikipedia text

The requirment of the project are as follows :-

  • Scrap data using wikipedia API to perform NER
  • Perform Named Entity Recognition on scrapped data and extract entities like city, person, organisation, Date, Geographical Entity, Product etc.
  • Display annotated text in Streamlit App.

Language Used

  • Python

Library, Packaged and API Used

  • Wikipedia (API)
  • Streamlit (Library)
  • Spacy_streamlit (Package)
  • Spacy (Library)

Steps to access the live project

  • Go to url (https://afternoon-shore-15753.herokuapp.com/) image

  • Enter any keyword in the textbox area on which you want to perform the NER image

  • And just move your cursor out from the text area

  • If entered keyword matches with any wikipedia page title you will see the output below. image

How to install the project

Open VScode and open a terminal inside it and run the following steps

  1. Clone this repository using the code below.

    • git clone https://github.com/aman2457/bi-ner.git
  2. Install the required package and libraries using command.

    • pip install -r requirements.txt
  3. Now run the below command in cli to open the app.

    • streamlit run app.py

Some information regarding the app.

  1. The app fetch the text from wikipedia which matches with the user's keyword. If multiple pages found with same keyword then a random page is choosen.
  2. The extracted text then loaded into a NLP model which peform NER.
  3. After the NER the ouptut is feeded into a spacy_streamlit.visualize_ner funtion of streamlit_spacy which visualize the text based on NER.

Caution :-

  1. If you are not getting output on the given keyword there may be folliwing reason:-
    • Server get time out
    • No Wikipedia page matched with keyword
    • A page title unexpectedly resolves to a redirect

bi-ner's People

Contributors

aman2457 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.