Comments (13)
It could be the new format of perf
output. What is the perf
version that your are running and on what hardware? Ideally I'd like to see the output of perf script -F pid,brstack ...
, or at least at a part of it that was causing the trouble.
Also, our diagnostics was broken. Could you please pull the latest changes and re-run the command?
from bolt.
./perf --version
perf version 3.10.0-693.5.2.el7.x86_64.debug
The hardware is Intel Xeon Phi 7250. The hardware i am running perf2bolt, though is Intel(R) Xeon(R) CPU E5-2660 v2 @ 2.20GHz, same kernel and OS version, Centos 7.
Will respond your other questions later.
Thanks.
from bolt.
With the fixed diagnostics it looks like this:
Error reading bolt data input file: line 1, column 43: expected single char for mispred bit
Found: -
PERF2BOLT: Failed to parse samples
from bolt.
sudo perf script -f -F pid,brstack
First few lines of output:
7466 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0
7466 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0
7466 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0
7466 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0 0xffffffff816ac32c/0x7f2c9cf100fe/-/-/-/0
from bolt.
For the perf
issue it's an easy fix, assuming that otherwise the output format is the same. If you can upgrade to perf
version 4.5
or later it might be easier to guarantee that. The more serious issue I see is with __intel_avx_rep_memcpy
and __intel_new_memcpy
functions referenced in the log. We'll need to add a support for jump tables / data in code, otherwise we cannot process them in relocation mode. The workaround is to use -relocs=0
, and it is not good for performance.
from bolt.
Grabbed perf from kernel 4.5:
by git checkout v4.5 and compiled it.
$ perf --version
perf version 4.5.gb562e4
Collected traces :
./perf record -e cycles:u -j any,u -o perf.data -- (...)
Then on a different machine after copying the perf.data:
sudo perf script -F pid,brstack -f
172359 0xffffffff816ac32c/0x7fc9b763c0fe/-/-/-/0
172359 0xffffffff816ac32c/0x7fc9b763c0fe/-/-/-/0 0xffffffff816ac32c/0x7fc9b763c0fe/-/-/-/0
172359 0xffffffff816ac32c/0x7fc9b763c0fe/-/-/-/0 0xffffffff816ac32c/0x7fc9b763c0fe/-/-/-/0 0xffffffff816ac32c/0x7fc9b763c0fe/-/-/-/0
I have also tried perf version 4.14.rc5.g7c584a with the same results.
from bolt.
I guess the problem here is not in perf version but possibly perf on KNL not supporting the correct hw event.
From:
https://gitlab.imag.fr/kaunetem/linux-kaunetem/commit/dc323ce8e72d6d1beb9af9bbd29c4d55ce3d7fb0
M/P/-: M=branch target mispredicted or branch direction was mispredicted, P=target predicted or direction predicted, -=not supported
I will have a look how perf gets it.
from bolt.
Yep this one is a KNL issue. On other CPU i got:
0xffffffff816ac32c/0x7f7b322365a0/P/-/-/0
from bolt.
Interesting. We only use misprediction bit for ICP optimization, so it should be safe to ignore it. I'll add a patch. Thank you for the investigation.
from bolt.
Great, thanks! So how about the intel memcpy (and others) replacements?
Is there anything i can do to help? Should a new issue be raised?
from bolt.
Is there any way to force intel compiler to use libc functions? It justifies the new issue in any case.
from bolt.
Looks good, closing:
PERF2BOLT: Waiting for perf tasks collection to finish...
PERF2BOLT: Parsing perf-script tasks output
PERF2BOLT: Input binary is associated with 1 PID(s)
PERF2BOLT: Waiting for perf events collection to finish...
PERF2BOLT: Aggregating branch events...
PERF2BOLT-WARNING: misprediction bit is missing in profile
from bolt.
Is there any way to force intel compiler to use libc functions?
-ffreestanding seems to cut it.
https://software.intel.com/en-us/forums/intel-c-compiler/topic/306063
from bolt.
Related Issues (20)
- A dump about function cannot be properly disassembled when use -use-old-text HOT 2
- perf2bolt: crashes with assertion. HOT 5
- A NullPtrException after BOLT on libart.so HOT 2
- BOLT/LLVM? does not preserve prefixes on conditional branches HOT 2
- Assert error in resolveAArch64Relocation HOT 15
- ELF32 support HOT 1
- BOLT only failed with --hugify HOT 13
- Why skip-function when it jump to itself? HOT 7
- Why Is registerName Executed Twice for a Symbol? HOT 1
- How do I run BOLT on a benchmark of HHVM? HOT 1
- A Problem About .got Table Updates HOT 4
- Promote BOLT to the chromium team HOT 11
- A Problem about IsSimple be used in disassemble
- Any ideas to support riscv architecture? HOT 5
- clang14 build with -update-debug-sections rewrite debug info core HOT 14
- A fatal error about -update-debug-sections HOT 3
- Any suggest to couple two functions? HOT 11
- How to use --hugify option? HOT 20
- Failed to BOLT HOT 2
- [C++] Exceptions catch leads to Segmentation fault
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bolt.