Comments (7)
Thanks for the updates. For your workload, you can try spawning threads to read multiple files at the same time. That should get you close to the instance bandwidth. We will also update documentation on our side.
from mountpoint-s3.
Hey @karthik1900, that throughput seems really low. Can you share more about how you read data from the mounted directory? Based on our benchmark results (https://awslabs.github.io/mountpoint-s3/dev/bench/), increasing number of concurrent reads and opening a file in O_DIRECT
mode might help.
from mountpoint-s3.
Hey @monthonk, thanks for the quick response
I am using a simple python open file
with open("myfile", "rb") as f:
data = f.read()
Will try with the O_DIRECT mode and update here
from mountpoint-s3.
When I use fio to to read the file, I am getting about 22 Gb/s, I guess the issue is with my read
fio --name=read --rw=read --bs=4m --direct=1 --filename="myfile"
from mountpoint-s3.
Hey @monthonk
Is there a way where we can set a configuration which will allow multiple threads to read one large file (as s3 allows reading in chunks)
from mountpoint-s3.
Hey, we already do that in the background. In mountpoint, a file system read request could spawn multiple GetObject requests and download object parts in parallel. The bottleneck is at the FUSE interface when you are reading the file. This PR https://github.com/awslabs/mountpoint-s3/pull/834/files explains a bit about that.
from mountpoint-s3.
Hi! We merged that particular documentation change (#834), so we now have a section in troubleshooting slower throughput than expected: https://github.com/awslabs/mountpoint-s3/blob/main/doc/TROUBLESHOOTING.md#slower-throughput-than-expected
If there's any more questions here, please let us know - or we can close the issue. Thanks!
from mountpoint-s3.
Related Issues (20)
- Reads from a file being written can succeed from page cache
- Kubernetes pods stuck with multiple S3 mount points HOT 1
- Automatically create unique cache directories HOT 1
- Support providing `AES256` SSE type with each PutObject request HOT 1
- mount-s3 stopped working without explanation HOT 4
- Making the S3 filesystem available to the host from inside a docker container HOT 4
- Add --bind option to specify address when running on a host with multiple NICs HOT 2
- Error splicing file: File too large HOT 1
- Missing docs enumerating options HOT 3
- `ls` fails every second run | `rewinddir` support HOT 11
- Operations hang indefinitely after opening too many files HOT 2
- Ability to configure maximum retry count
- Metadata Caching in case of not existing file HOT 12
- Need a function to monitoring the total usage of S3 mountpoint or S3 PVC HOT 1
- Slurm(srun) unable to start mountpoint HOT 11
- Support wasabi s3 storage HOT 1
- Support ovhcloud s3 high performance object storage HOT 1
- Support for writing to Google Cloud Storage buckets HOT 12
- I think there should be a cool icon for this product HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mountpoint-s3.