Comments (4)
After sprinkling in a bunch of instrument attributes, spans, and a flamegraph layer; I can confirm that all of our time is spent in deserializing at mountpoint_s3::data_cache::disk_data_cache.
from mountpoint-s3.
Based on looking at metrics in the debug logs it looks like the cache is "working", blocks are pulled and read from cache. The performance is not close to what I would expect for reading from disk. The same 10G file copied to my /mnt/raid0 disk comes off as such...
# time cat /mnt/raid0/file_1 > /dev/null
real 0m1.150s
user 0m0.000s
sys 0m1.150s
Next steps I'm going to try adding some flamegraphs to the tracing and see where this peaking.
from mountpoint-s3.
Thanks for reporting this. In [1], we are introducing cache benchmarks. With this in place, we want to explore improvements by experimenting with changing the serialization library, for example.
[1] #783
from mountpoint-s3.
I should mention that the tracing changes I made are in my fork here https://github.com/raykrueger/mountpoint-s3/tree/flamegraphs
I had no intention of sending a PR for the changes as they were pretty heavy handed.
from mountpoint-s3.
Related Issues (20)
- Improvements to error reporting (RFC)
- Add the expected behaviours of Mountpoint in the troubleshooting page that customers face HOT 1
- Duplication of file and directory of same name in S3 Express Zone One bucket using Mountpoint
- Build is not reproducible HOT 1
- Double `touch` errors HOT 2
- Include creation of files for benchmarking Mountpoint in benchmarking script HOT 1
- token is not refreshed when running in k8s pod connected to serviceaccount HOT 8
- Unable to Write HOT 4
- Files unexpectedly released leading to `Bad file descriptor` HOT 5
- Disabling data cache isn't mentioned in the documentation HOT 3
- Unable to mount non-aws storage due to "Host name was invalid for dns resolution" HOT 2
- Clarify the expected behavior of the `--metadata-ttl` option set to zero HOT 3
- Separate part sizes for GET and PUT HOT 3
- `--metadata-ttl` option requires `--cache` directory HOT 1
- Support for BottlerocketOS HOT 1
- `stat` call for `dir1/dir2/dir3/file` triggers many List and Head requests HOT 1
- error in issue template HOT 2
- ls: cannot access /mnt/s3: Connection refused HOT 2
- `stat` call for path triggers two requests (one List and one Head) with enabled cache HOT 1
- Input/output error during reading deleted object with cached metadata HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mountpoint-s3.