Comments (6)
Hi @EntilZha thanks for your interest in our work! Which URL exactly is slow to download (so we can better diagnose)? You're trying to download from Seattle?
from pyserini.
Hi, while I am in Seattle :), the download should originate from 208.53.44.121 if curl ifconfig.me
is accurate, so Utah. The downloads to Seattle seem to be normal speed FWIW.
EDIT: One example is the file at https://rgw.cs.uwaterloo.ca/pyserini/indexes/dindex-msmarco-passage-tct_colbert-v2-bf-20210608-5f341b.tar.gz
from pyserini.
Hrm... odd... we've never had issues with this host before. @MXueguang @jacklin64 @alexlimh have you had issues from Seattle before?
from pyserini.
Weird, I didn't got this issue before. @EntilZha are you looking for just a specific index? we can upload it to dropbox for you.
from pyserini.
Sorry for delay (vacation last week), it seems that if I am on local Seattle internet, I can download it with normal speed, so could then upload it from local computer. Its when I download from the cluster with Utah IP that I run into slower connection. Not per see looking for a specific index, in this case was setting up a pipeline so was testing with smallest size index could find before trying larger ones. I'll see if I can debug connection a little.
from pyserini.
Seeing no follow up, closing issue!
from pyserini.
Related Issues (20)
- SLIM regressions on MS MARCO v2 passage HOT 1
- DPR encoder seemingly missing attention mask HOT 3
- Is it possible to seperate bm25 search module in pyserini?
- ODQA NQ regression error HOT 1
- Unrecognized index name wikipedia-dpr-100w.dpr-multi
- Will different searcher and document_searcher affect the search results?
- Bug introduced by #1622 max_length in init_query_encoder HOT 1
- Normalize embeddings when using a custom dense encoder? HOT 3
- How to add stop words when building BM25 index?
- duplicate query encoder code
- Feature request: docker build for portability HOT 3
- test cases time out
- BM25 batch search with multi threads error: java.lang.OutOfMemoryError: Java heap space HOT 1
- Incorporate SPLADE++ ED BEIR regressions HOT 2
- How to build collections using msmarco and beir HOT 2
- How to get raw content HOT 4
- In Splade example for MS Marco evaluation why index 8.8M train passages and evaluate wiht 6980 queries from dev ?
- trec_eval error HOT 6
- LuceneSearcher + multiprocessing problem
- Upgrading to Pyserini 0.24 means `.raw` option not available. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pyserini.