Git Product home page Git Product logo

Comments (2)

clarahohohoho avatar clarahohohoho commented on September 24, 2024

Hi, I found out that in the write_records_tcd.py step, the function write_bmp_records() function in the dataset_write.py script the variable append_aus is defaulted to False. I have tried changing it to append_aus=True however another issue surfaced, where ValueError: 'AU25_r' is not in list. This is probably because in my csv files, the headers are only limited to ['frame,face_id,timestamp,confidence,success,pose_Tx,pose_Ty,pose_Tz,pose_Rx,pose_Ry,pose_Rz']. Any idea how can I proceed? Thank you!

from avsr-tf1.

georgesterpu avatar georgesterpu commented on September 24, 2024

Hi @clarahohohoho
aus stands for facial action units. We proposed to regress AUs from video representations jointly with the speech decoding task in order to overcome a learning issue of AV Align (audio encoder attends to video encoder) seen on a more challenging task than speaker-dependent TCD-TIMIT.

If you are running an experiment using the run_audiovisual.py script, please note the following parameter: regress_aus=True. When this flag is enabled, it is expected that the tfrecord file contains a sequence of action unit intensities, allowing the computation of the distance between these ground truth values and the network's prediction. You may set this flag to False, depending on the goals of your research.

To generate target values for the AU intensities, we used the OpenFace toolkit. The extract_faces.py script is a wrapper that calls the OpenFace binaries from Python and generates the bmp and csv files in the format expected by the code in this repository. The Action Units are written to the csv by appending the -aus flag, please see here the complete set of CLI arguments. I realise now that the -aus flag is not used in the example pre-processing script, but regress_aus is set to True in the AV experiment launch script, so I'll correct this issue.

You may need to pre-process again the video files setting the -aus flag in extract_faces.py, then re-generate the tfrecords. For convenience, I stored a single set of tfrecord files appended with this metadata, and only enabled or disabled AUs at runtime.

I hope this helps, please let me know if there is something else to clarify.

from avsr-tf1.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.