I use the colab implementation. The outcome of the style transfer seems to be a transfer of the amplitudes but the frequencies seem to be not right. My model also only trains for maybe 15-30 min. on a 3min source.
Everything is executable without errors. But I get a lot of warnings in the section "Preprocess raw audio into TFRecord dataset" and in the section "We will now begin training. "
Maybe it s no problem. (?)
WARNING:tensorflow:From /tensorflow-2.1.0/python3.6/tensorflow_core/python/compat/v2_compat.py:88: disable_resource_variables (from tensorflow.python.ops.variable_scope) is deprecated and will be removed in a future version.
Instructions for updating:
non-resource variables are not supported in the long term
WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/ddsp/training/train_util.py:189: The name tf.estimator.tpu.RunConfig is deprecated. Please use tf.compat.v1.estimator.tpu.RunConfig instead.
W0128 20:38:03.140560 139811238639488 module_wrapper.py:138] From /usr/local/lib/python3.6/dist-packages/ddsp/training/train_util.py:189: The name tf.estimator.tpu.RunConfig is deprecated. Please use tf.compat.v1.estimator.tpu.RunConfig instead.
WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/ddsp/training/train_util.py:191: The name tf.estimator.tpu.TPUConfig is deprecated. Please use tf.compat.v1.estimator.tpu.TPUConfig instead.
W0128 20:38:03.140788 139811238639488 module_wrapper.py:138] From /usr/local/lib/python3.6/dist-packages/ddsp/training/train_util.py:191: The name tf.estimator.tpu.TPUConfig is deprecated. Please use tf.compat.v1.estimator.tpu.TPUConfig instead.
WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/ddsp/training/train_util.py:199: The name tf.estimator.tpu.TPUEstimator is deprecated. Please use tf.compat.v1.estimator.tpu.TPUEstimator instead.
W0128 20:38:03.141086 139811238639488 module_wrapper.py:138] From /usr/local/lib/python3.6/dist-packages/ddsp/training/train_util.py:199: The name tf.estimator.tpu.TPUEstimator is deprecated. Please use tf.compat.v1.estimator.tpu.TPUEstimator instead.
INFO:tensorflow:Using config: {'_model_dir': '/content/models/ddsp-solo-instrument', '_tf_random_seed': None, '_save_summary_steps': 300, '_save_checkpoints_steps': 300, '_save_checkpoints_secs': None, '_session_config': allow_soft_placement: true
graph_options {
rewrite_options {
meta_optimizer_iterations: ONE
}
}
, '_keep_checkpoint_max': 100, '_keep_checkpoint_every_n_hours': 1, '_log_step_count_steps': None, '_train_distribute': None, '_device_fn': None, '_protocol': None, '_eval_distribute': None, '_experimental_distribute': None, '_experimental_max_worker_delay_secs': None, '_session_creation_timeout_secs': 7200, '_service': None, '_cluster_spec': ClusterSpec({}), '_task_type': 'worker', '_task_id': 0, '_global_id_in_cluster': 0, '_master': '', '_evaluation_master': '', '_is_chief': True, '_num_ps_replicas': 0, '_num_worker_replicas': 1, '_tpu_config': TPUConfig(iterations_per_loop=300, num_shards=None, num_cores_per_replica=None, per_host_input_for_training=2, tpu_job_name=None, initial_infeed_sleep_secs=None, input_partition_dims=None, eval_training_input_configuration=2, experimental_host_call_every_n_steps=1), '_cluster': None}
I0128 20:38:03.141758 139811238639488 estimator.py:216] Using config: {'_model_dir': '/content/models/ddsp-solo-instrument', '_tf_random_seed': None, '_save_summary_steps': 300, '_save_checkpoints_steps': 300, '_save_checkpoints_secs': None, '_session_config': allow_soft_placement: true
graph_options {
rewrite_options {
meta_optimizer_iterations: ONE
}
}
, '_keep_checkpoint_max': 100, '_keep_checkpoint_every_n_hours': 1, '_log_step_count_steps': None, '_train_distribute': None, '_device_fn': None, '_protocol': None, '_eval_distribute': None, '_experimental_distribute': None, '_experimental_max_worker_delay_secs': None, '_session_creation_timeout_secs': 7200, '_service': None, '_cluster_spec': ClusterSpec({}), '_task_type': 'worker', '_task_id': 0, '_global_id_in_cluster': 0, '_master': '', '_evaluation_master': '', '_is_chief': True, '_num_ps_replicas': 0, '_num_worker_replicas': 1, '_tpu_config': TPUConfig(iterations_per_loop=300, num_shards=None, num_cores_per_replica=None, per_host_input_for_training=2, tpu_job_name=None, initial_infeed_sleep_secs=None, input_partition_dims=None, eval_training_input_configuration=2, experimental_host_call_every_n_steps=1), '_cluster': None}
INFO:tensorflow:_TPUContext: eval_on_tpu False
I0128 20:38:03.142008 139811238639488 tpu_context.py:221] _TPUContext: eval_on_tpu False
WARNING:tensorflow:From /tensorflow-2.1.0/python3.6/tensorflow_core/python/ops/resource_variable_ops.py:1635: calling BaseResourceVariable.__init__ (from tensorflow.python.ops.resource_variable_ops) with constraint is deprecated and will be removed in a future version.
Instructions for updating:
If using Keras pass *_constraint arguments to layers.
W0128 20:38:03.145722 139811238639488 deprecation.py:506] From /tensorflow-2.1.0/python3.6/tensorflow_core/python/ops/resource_variable_ops.py:1635: calling BaseResourceVariable.__init__ (from tensorflow.python.ops.resource_variable_ops) with constraint is deprecated and will be removed in a future version.
Instructions for updating:
If using Keras pass *_constraint arguments to layers.
WARNING:tensorflow:From /tensorflow-2.1.0/python3.6/tensorflow_core/python/training/training_util.py:236: Variable.initialized_value (from tensorflow.python.ops.variables) is deprecated and will be removed in a future version.
Instructions for updating:
Use Variable.read_value. Variables in 2.X are initialized automatically both in eager and graph (inside tf.defun) contexts.
W0128 20:38:03.146073 139811238639488 deprecation.py:323] From /tensorflow-2.1.0/python3.6/tensorflow_core/python/training/training_util.py:236: Variable.initialized_value (from tensorflow.python.ops.variables) is deprecated and will be removed in a future version.
Instructions for updating:
Use Variable.read_value. Variables in 2.X are initialized automatically both in eager and graph (inside tf.defun) contexts.
INFO:tensorflow:Calling model_fn.
I0128 20:38:03.266683 139811238639488 estimator.py:1151] Calling model_fn.
INFO:tensorflow:Running train on CPU
I0128 20:38:03.266903 139811238639488 tpu_estimator.py:3124] Running train on CPU
I0128 20:38:04.029543 139811238639488 processors.py:138] Connecting node (additive):
I0128 20:38:04.029708 139811238639488 processors.py:140] Input 0: amps
I0128 20:38:04.029782 139811238639488 processors.py:140] Input 1: harmonic_distribution
I0128 20:38:04.029845 139811238639488 processors.py:140] Input 2: f0_hz
I0128 20:38:04.095593 139811238639488 processors.py:138] Connecting node (filtered_noise):
I0128 20:38:04.095721 139811238639488 processors.py:140] Input 0: noise_magnitudes
I0128 20:38:04.194273 139811238639488 processors.py:138] Connecting node (add):
I0128 20:38:04.194403 139811238639488 processors.py:140] Input 0: filtered_noise/signal
I0128 20:38:04.194476 139811238639488 processors.py:140] Input 1: additive/signal
I0128 20:38:04.194946 139811238639488 processors.py:138] Connecting node (reverb):
I0128 20:38:04.195056 139811238639488 processors.py:140] Input 0: add/signal
I0128 20:38:04.302336 139811238639488 processors.py:157] ProcessorGroup output node (reverb)
I0128 20:38:04.933219 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc/dense/kernel:0 (shape=(1, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.933395 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933452 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc/layer_normalization/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933498 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc/layer_normalization/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933540 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc_1/dense/kernel:0 (shape=(512, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.933584 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc_1/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933624 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc_1/layer_normalization_1/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933666 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc_1/layer_normalization_1/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933704 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc_2/dense/kernel:0 (shape=(512, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.933745 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc_2/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933784 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc_2/layer_normalization_2/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933821 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack/fc_2/layer_normalization_2/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.933857 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc/dense/kernel:0 (shape=(1, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.933896 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934019 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc/layer_normalization_3/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934084 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc/layer_normalization_3/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934144 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc_1/dense/kernel:0 (shape=(512, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.934210 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc_1/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934275 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc_1/layer_normalization_4/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934336 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc_1/layer_normalization_4/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934394 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc_2/dense/kernel:0 (shape=(512, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.934453 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc_2/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934509 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc_2/layer_normalization_5/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934564 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_1/fc_2/layer_normalization_5/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934619 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/gru/kernel:0 (shape=(1024, 1536), dtype=<dtype: 'float32'>).
I0128 20:38:04.934680 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/gru/recurrent_kernel:0 (shape=(512, 1536), dtype=<dtype: 'float32'>).
I0128 20:38:04.934738 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/gru/bias:0 (shape=(1536,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934794 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc/dense/kernel:0 (shape=(1536, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.934854 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934910 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc/layer_normalization_6/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.934981 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc/layer_normalization_6/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.935039 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc_1/dense/kernel:0 (shape=(512, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.935099 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc_1/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.935154 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc_1/layer_normalization_7/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.935209 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc_1/layer_normalization_7/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.935270 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc_2/dense/kernel:0 (shape=(512, 512), dtype=<dtype: 'float32'>).
I0128 20:38:04.935333 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc_2/dense/bias:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.935391 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc_2/layer_normalization_8/gamma:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.935447 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/fc_stack_2/fc_2/layer_normalization_8/beta:0 (shape=(512,), dtype=<dtype: 'float32'>).
I0128 20:38:04.935502 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/dense/kernel:0 (shape=(512, 126), dtype=<dtype: 'float32'>).
I0128 20:38:04.935561 139811238639488 models.py:230] adding trainable variable rnn_fc_decoder/dense/bias:0 (shape=(126,), dtype=<dtype: 'float32'>).
I0128 20:38:04.935617 139811238639488 models.py:230] adding trainable variable ir:0 (shape=(48000,), dtype=<dtype: 'float32'>).
WARNING:tensorflow:From /usr/local/lib/python3.6/dist-packages/ddsp/training/train_util.py:126: The name tf.estimator.tpu.TPUEstimatorSpec is deprecated. Please use tf.compat.v1.estimator.tpu.TPUEstimatorSpec instead.
W0128 20:38:06.495219 139811238639488 module_wrapper.py:138] From /usr/local/lib/python3.6/dist-packages/ddsp/training/train_util.py:126: The name tf.estimator.tpu.TPUEstimatorSpec is deprecated. Please use tf.compat.v1.estimator.tpu.TPUEstimatorSpec instead.
INFO:tensorflow:Done calling model_fn.
I0128 20:38:06.514110 139811238639488 estimator.py:1153] Done calling model_fn.
INFO:tensorflow:Create CheckpointSaverHook.
I0128 20:38:06.515020 139811238639488 basic_session_run_hooks.py:546] Create CheckpointSaverHook.
INFO:tensorflow:Graph was finalized.
I0128 20:38:07.337075 139811238639488 monitored_session.py:246] Graph was finalized.
2020-01-28 20:38:07.447110: W tensorflow/core/common_runtime/gpu/gpu_bfc_allocator.cc:39] Overriding allow_growth setting because the TF_FORCE_GPU_ALLOW_GROWTH environment variable is set. Original config value was 0.
INFO:tensorflow:Running local_init_op.
I0128 20:38:08.580479 139811238639488 session_manager.py:504] Running local_init_op.
INFO:tensorflow:Done running local_init_op.
I0128 20:38:08.617971 139811238639488 session_manager.py:507] Done running local_init_op.
INFO:tensorflow:Saving checkpoints for 0 into /content/models/ddsp-solo-instrument/model.ckpt.
I0128 20:38:10.710745 139811238639488 basic_session_run_hooks.py:613] Saving checkpoints for 0 into /content/models/ddsp-solo-instrument/model.ckpt.
INFO:tensorflow:global_step/sec: 0.166465
I0128 20:38:26.712445 139811238639488 tpu_estimator.py:2307] global_step/sec: 0.166465
INFO:tensorflow:examples/sec: 2.66345
I0128 20:38:26.713377 139811238639488 tpu_estimator.py:2308] examples/sec: 2.66345
INFO:tensorflow:global_step/sec: 0.556173
I0128 20:38:28.510447 139811238639488 tpu_estimator.py:2307] global_step/sec: 0.556173
INFO:tensorflow:examples/sec: 8.89877
I0128 20:38:28.510792 139811238639488 tpu_estimator.py:2308] examples/sec: 8.89877
INFO:tensorflow:global_step/sec: 0.587273
I0128 20:38:30.213216 139811238639488 tpu_estimator.py:2307] global_step/sec: 0.587273
...
When I upload the model in the style transfer colab, the resynthesized sample sounds like rythmic noise / wind.