Git Product home page Git Product logo

dsvilarkovic / clickbaitonator Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 22.37 MB

Continuous Conditional text generation of clickbait newspaper articles. Using fine-tuned Pegasus text summarizer, with Fudge and PPLM conditional text generation models.

Home Page: https://huggingface.co/spaces/EuroPython2022/clickbaitonator

License: GNU General Public License v3.0

Jupyter Notebook 94.10% Python 4.50% Shell 0.01% JavaScript 1.41%
clickbait pegasus text-summarization

clickbaitonator's Introduction

clickbaitonator

NLP & LSS 2022 project aimed to create conditional clickbait titles based on the article content.
Check the Huggingface Spaces app at : https://huggingface.co/spaces/Dusan/clickbaitonator

Files used for running models

ClickbaitClassifier.ipynb - notebook for training clickbait classifier.
Fudge_running.ipynb - Fudge based implementation for generating controllable clickbait titles. It contains setup for generating titles, rating them according to our clickbait classifier and saving in respective dataframes. Our Huggingface Spaces implementation uses this model due to realtime response. At the end of this notebook there is also a Gradio section where you can launch in-notebook Gradio application.
PPLM_running.ipynb - PPLM based implementation for generating controllable clickbait titles. It contains setup for generating titles, rating them according to our clickbait classifier and saving in respective dataframes.
DataSet_sentence_collector.ipynb - sentence collector on Fudge and PPLM dataframes generated by Fudge_running.ipynb and PPLM_running.ipynb. It was used for sampling best titles that were generated.
DatasetAnalysis.ipynb - notebook for running wordcloud and LDA topic modelling analysis on ground truth titles.

Folders used for running analysis and models

DataProcessing - used for generating all relevant tables in a report and presentation. Contains fudge_csv, PPLM_csv and pegasus_clickbait_summarizer_csv folders for .csv files that were used for the report and visualizations of respective models.
fudge - Fudge adapted to our clickbait summarizer.
PPLM - PPLM adapted to our clickbait summarizer.

clickbaitonator's People

Contributors

dsvilarkovic avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.