Comments (2)
Hi, I found out that in the write_records_tcd.py step, the function write_bmp_records()
function in the dataset_write.py script the variable append_aus
is defaulted to False. I have tried changing it to append_aus=True
however another issue surfaced, where ValueError: 'AU25_r' is not in list
. This is probably because in my csv files, the headers are only limited to ['frame,face_id,timestamp,confidence,success,pose_Tx,pose_Ty,pose_Tz,pose_Rx,pose_Ry,pose_Rz']
. Any idea how can I proceed? Thank you!
from avsr-tf1.
Hi @clarahohohoho
aus
stands for facial action units. We proposed to regress AUs from video representations jointly with the speech decoding task in order to overcome a learning issue of AV Align (audio encoder attends to video encoder) seen on a more challenging task than speaker-dependent TCD-TIMIT.
If you are running an experiment using the run_audiovisual.py
script, please note the following parameter: regress_aus=True
. When this flag is enabled, it is expected that the tfrecord file contains a sequence of action unit intensities, allowing the computation of the distance between these ground truth values and the network's prediction. You may set this flag to False, depending on the goals of your research.
To generate target values for the AU intensities, we used the OpenFace toolkit. The extract_faces.py
script is a wrapper that calls the OpenFace binaries from Python and generates the bmp and csv files in the format expected by the code in this repository. The Action Units are written to the csv by appending the -aus
flag, please see here the complete set of CLI arguments. I realise now that the -aus
flag is not used in the example pre-processing script, but regress_aus
is set to True
in the AV experiment launch script, so I'll correct this issue.
You may need to pre-process again the video files setting the -aus
flag in extract_faces.py
, then re-generate the tfrecords. For convenience, I stored a single set of tfrecord files appended with this metadata, and only enabled or disabled AUs at runtime.
I hope this helps, please let me know if there is something else to clarify.
from avsr-tf1.
Related Issues (20)
- Inquiry about some parameter selection reason HOT 5
- awgn: out of bounds when sampling noise clip HOT 1
- [AMSGrad] get running error in audio only training HOT 3
- How do I change loss function to CTC?? HOT 2
- [feature] minimum data length for stack log mel feature HOT 2
- visemes and phonemes mapping HOT 1
- How many epochs should i train? HOT 2
- How can i change this av_align model for applying to audio2video?
- What should I do to reproduce the results of the paper? HOT 5
- How can I solve this problem HOT 6
- Inquiry about aus csv generation HOT 2
- What do the folders speaker-dependence and speaker-independence stand for? HOT 2
- How to run this program on multiple GPUs HOT 5
- The tfrecord files HOT 8
- use video-only on LRS2 HOT 1
- The result in noisy environment HOT 6
- how to pad features and labels to same length in one batch ? HOT 1
- Ask about epochs and learning rate HOT 1
- ask about the results HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from avsr-tf1.