Git Product home page Git Product logo

mahatkc / canliesbefaked Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 0.0 3.38 MB

Repository for the paper "Can lies be faked? Comparing low-stakes and high-stakes deception video datasets from a Machine Learning perspective"

Home Page: https://www.sciencedirect.com/science/article/pii/S0957417424005505

Python 100.00%
computer-vision deception-detection deep-learning machine-learning video-detection

canliesbefaked's Introduction

Can lies be faked?

This repository was created for my BSc Thesis (or Monograph, or Final Year Project, or "Trabalho de Conclusão de Curso", depending on how you might call it): "Video-based Deception Detection with Deep Learning". You can read the whole Thesis here (in Portuguese).

However, the project was developed further and a paper has been written based on the Thesis: "Can lies be faked? Comparing low-stakes and high-stakes deception video datasets from a Machine Learning perspective", published in Expert Systems with Applications.

This repository is organized as follows: Code folder has the code used to run experiments and has instructions on how to organize the datasets so experiments run properly; Results folder has the CSV files with the results of each of the runs performed during the hyperparameter search phase, as well as TXT files with detailed results from 5-Fold testing.


Paper Abstract

Despite the great impact of lies in human societies and a meager 54% human accuracy for Deception Detection (DD), Machine Learning systems that perform automated DD are still not viable for proper application in real-life settings due to data scarcity. Few publicly available DD datasets exist and the creation of new datasets is hindered by the conceptual distinction between low-stakes and high-stakes lies. Theoretically, the two kinds of lies are so distinct that a dataset of one kind could not be used for applications for the other kind. Even though it is easier to acquire data on low-stakes deception since it can be simulated (faked) in controlled settings, these lies do not hold the same significance or depth as genuine high-stakes lies, which are much harder to obtain and hold the practical interest of automated DD systems. To investigate whether this distinction holds true from a practical perspective, we design several experiments comparing a high-stakes DD dataset and a low-stakes DD dataset evaluating their results on a Deep Learning classifier working exclusively from video data. In our experiments, a network trained in low-stakes lies had better accuracy classifying high-stakes deception than low-stakes, although using low-stakes lies as an augmentation strategy for the high-stakes dataset decreased its accuracy.


Thesis Abstract

Deception Detection is a task in which humans show an outstanding difficulty, reaching an accuracy of only 54% according to the literature. Despite that, this task becomes significantly relevant in contexts such as trials, interviews and criminal investigations, in which the impact of mistakenly classifying a discourse as deceptive or truthful may be catastrophic. In this sense, lies and their identification have spiked the interest of researchers for centuries, with the creation of devices destined to aid in deception detection and, more recently, with the development of Machine Learning and Deep Learning systems capable of properly classifying them. Based on these systems, this work analyzes literature on the subject and implements a Deep Neural Network based on the SlowFast architecture using the Real-Life Trial dataset reaching an accuracy of 66.36%. Special consideration is given to ethical issues and limitations concerning the use of Machine Learning and Deep Learning systems that perform deception detection, with the recommendation that these systems not be used in real-life situations given the existing limitations for their satisfactory development.


canliesbefaked's People

Contributors

mahatkc avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.