Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Hi <a class="user-mention notranslate" data-hov

Should I regenerate the savedmodel when I use a new project about ml-compiler-opt HOT 22 CLOSED

google commented on May 2, 2024

Should I regenerate the savedmodel when I use a new project

from ml-compiler-opt.

Comments (22)

yundiqian commented on May 2, 2024 1

I see, that is possible, Fuchsia code may be quite different from the v8 code so the model trained on fuchsia does not work well on v8

from ml-compiler-opt.

mtrofin commented on May 2, 2024 1

Yes, when training your own model, disable lto, and (of course) make sure you're passing -Oz to clang.

from ml-compiler-opt.

yundiqian commented on May 2, 2024 1

Hi @yundiqian, I have migrated the demo project to the chrome/v8 project and got 5% percent reduction of size in binary and I wanna know if I need to regenerate the saved model or use the exactly one generated by Fuchsia etc?

I'm a little confused, to be clear, which model caused 5% percent reduction of size on which binary?

emm.. I have tried three projects using the ml-compiler-opt and got 7% size-reduction on Fuchsia demo, 5% (trained 100*2000 out of time consideration)on chrome/v8 build and -2% on my personal project (-.-)
I am retraining the third model cuz I trained it with -flto last time.
plus.I am migrating the model in an Android CMake project which the strip-binary-size is 1.7MB or so

got it, so it's 3 projects instead of 2 projects :) Is the "Android CMake project which the strip-binary-size is 1.7MB or so" the 4th project different from your personal projects?

In addition to retraining without -flto, you can also try our model included in llvm --- this is a model that we found generalizable across SPEC, so probably generalizable to your project as well.

from ml-compiler-opt.

mtrofin commented on May 2, 2024 1

I see now - thanks!

(fwiw - LLVM_ENABLE_LTO can be enabled for clang - just no -flto for your project)

from ml-compiler-opt.

yundiqian commented on May 2, 2024 1

hmm...interesting, we need to look into what happens during training to debug.

Can you share your log file during training with tensorboard.dev following the instructions here: https://tensorboard.dev/#get-started? (basically running two command lines)

When running "tensorboard dev upload --logdir logs..." , set the logdir flag to be the root_dir flag you use when running train_locally.py

from ml-compiler-opt.

Colibrow commented on May 2, 2024

After tested, the origin saved model could not be reused, so close this!

from ml-compiler-opt.

mtrofin commented on May 2, 2024

When you say "the original saved model could not be reused", do you mean you could not build the compiler with it embedded, or its performance wasn't as good as the one of the model you trained on chrome/v8?

from ml-compiler-opt.

Colibrow commented on May 2, 2024

When you say "the original saved model could not be reused", do you mean you could not build the compiler with it embedded, or its performance wasn't as good as the one of the model you trained on chrome/v8?

That compilers okay, but the binary size is bigger the one which doesn't use the model so I am training the new model for my personal project.

from ml-compiler-opt.

Colibrow commented on May 2, 2024

I see, that is possible, Fuchsia code may be quite different from the v8 code so the model trained on fuchsia does not work well on v8

@yundiqian @mtrofin
Sadly found that the model specify for the project didn't work and the so size was bigger than the one not trained. Is there any way to find which compile command influences the result? May I need to close the -flto flag?

from ml-compiler-opt.

mtrofin commented on May 2, 2024

We don't support lto currently.

The model included with llvm is a reasonable reference, but we didn't use an overly comprehensive codebase when we trained it; that's why Fuchsia, for example, builds their own, which holds up well over time (as their codebase and as the compiler evolve).

from ml-compiler-opt.

Colibrow commented on May 2, 2024

We don't support lto currently.

The model included with llvm is a reasonable reference, but we didn't use an overly comprehensive codebase when we trained it; that's why Fuchsia, for example, builds their own, which holds up well over time (as their codebase and as the compiler evolve).

So if I need to regenerate the model with llvm which disable the lto?

from ml-compiler-opt.

Colibrow commented on May 2, 2024

I'll try it.Thanks!

from ml-compiler-opt.

yundiqian commented on May 2, 2024

Hi @yundiqian, I have migrated the demo project to the chrome/v8 project and got 5% percent reduction of size in binary and I wanna know if I need to regenerate the saved model or use the exactly one generated by Fuchsia etc?

I'm a little confused, to be clear, which model caused 5% percent reduction of size on which binary?

from ml-compiler-opt.

Colibrow commented on May 2, 2024

Hi @yundiqian, I have migrated the demo project to the chrome/v8 project and got 5% percent reduction of size in binary and I wanna know if I need to regenerate the saved model or use the exactly one generated by Fuchsia etc?

I'm a little confused, to be clear, which model caused 5% percent reduction of size on which binary?

emm.. I have tried three projects using the ml-compiler-opt and got 7% size-reduction on Fuchsia demo, 5% (trained 100*2000 out of consideration of time)on chrome/v8 build and -2% on my personal project (-.-)
I am retraining the third model cuz I trained it with -flto last time.
plus.I am migrating the model in an Android CMake project which the strip-binary-size is 1.7MB or so

from ml-compiler-opt.

Colibrow commented on May 2, 2024

Okay

In addition to retraining without -flto, you can also try our model included in llvm --- this is a model that we found generalizable across SPEC, so probably generalizable to your project as well.

I will try it ASAP. So many thanks~

from ml-compiler-opt.

Colibrow commented on May 2, 2024

In addition to retraining without -flto, you can also try our model included in llvm --- this is a model that we found generalizable across SPEC, so probably generalizable to your project as well.

Unfortunately, the size after trained is bigger than the origin one which applies -flto -faddrsig/-flto -Wl,-z,norelro,-z,lazy,--icf=all.. about 3% or so.

from ml-compiler-opt.

mtrofin commented on May 2, 2024

To make sure I understand: you trained a model on your project (without lto, but -Oz); and then built with that model. (also without lto, and with -Oz)

How does that size compare to all other options being the same, except building with the default heuristic?

from ml-compiler-opt.

Colibrow commented on May 2, 2024

Here is the approaches:

normally build the project with nothing changed : binary is 1705 kb
now delete the -flto with other flags not changed: binary is 1755 kb
build the llvm with the latest model in llvm-project with flag(LLVM_ENABLE_LTO false ) and (TENSORFLOW_AOT_PATH)
then delete -flto, add -mllvm -enable-ml-inliner=release then got the binary is 1823kb

from ml-compiler-opt.

Colibrow commented on May 2, 2024

I haven't build the specify model now because of it needs a lot of time to train and if it's done, I'll post result here~

from ml-compiler-opt.

Colibrow commented on May 2, 2024

FYI, I've tested my personal project twice with SPEC model or not and found that the specific model is better than the SPEC but still worse than the origin one.
Here is some data(kilobytes):
origin: 1712312
close flto: 1763256
close flto and use specify mode(enable-ml-inliner): 1757576
use flto and use specify model: 1716344

from ml-compiler-opt.

Colibrow commented on May 2, 2024

hmm...interesting, we need to look into what happens during training to debug.

Can you share your log file during training with tensorboard.dev following the instructions here: https://tensorboard.dev/#get-started? (basically running two command lines)

When running "tensorboard dev upload --logdir logs..." , set the logdir flag to be the root_dir flag you use when running train_locally.py

Okay, I'll try it.

from ml-compiler-opt.

Colibrow commented on May 2, 2024

I've also tried the cronet project and the reduction is also not obvious... I doubt that if my training process is wrong?
Here is the detail when I was applying the model https://gist.github.com/Colibrow/9d2b31bc7eff127cfe74c807fce86451
And I found using flto may reduce more than applying the trained model single...And I will post the log file later~

from ml-compiler-opt.

Should I regenerate the savedmodel when I use a new project about ml-compiler-opt HOT 22 CLOSED

Comments (22)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent