Comments (4)
🤦🏽♀️ I have apparently over-estimated my abilities to copy & paste properly. Turns out Google Colab had to enable permissions to copy and paste the api key from my clipboard and that seemed to be the problem. Thank you so much for the support on a rather silly issue. Thank you for your contributions on this work, and looking forward to working with the dataset!
from mumin-build.
Hi @ramirezmichelle, and thanks for your input! Can you try to install the master version in the notebook (in the "Installation" section) and see if that fixes your issue?
from mumin-build.
Yep - I re-ran the installation with !pip install mumin[all]
and !pip install git+https://github.com/MuMiN-dataset/mumin-build@dev
instead of the notebook's !pip install mumin[all]==1.6.2 torchmetrics==0.7.2 --quiet
command, but it still seems to be throwing the same error. I also realized I left out some details in the error message version above, so here is the expanded version of the error that is thrown:
INFO:mumin.dataset:Loading dataset
INFO:mumin.dataset:Shrinking dataset
INFO:mumin.dataset:Rehydrating tweet nodes
Rehydrating: 0%
0/5261 [00:09<?, ?it/s]
---------------------------------------------------------------------------
KeyError Traceback (most recent call last)
[<ipython-input-7-da99dd72c67a>](https://localhost:8080/#) in <module>
----> 1 dataset.compile()
6 frames
[/usr/local/lib/python3.7/dist-packages/mumin/dataset.py](https://localhost:8080/#) in compile(self, overwrite)
204
205 # Rehydrate the tweets
--> 206 self._rehydrate(node_type='tweet')
207 self._rehydrate(node_type='reply')
208
[/usr/local/lib/python3.7/dist-packages/mumin/dataset.py](https://localhost:8080/#) in _rehydrate(self, node_type)
509 .merge(prehydration_df,
510 on='tweet_id',
--> 511 how='outer')
512 .reset_index(drop=True))
513
[/usr/local/lib/python3.7/dist-packages/pandas/core/frame.py](https://localhost:8080/#) in merge(self, right, how, on, left_on, right_on, left_index, right_index, sort, suffixes, copy, indicator, validate)
9201 copy=copy,
9202 indicator=indicator,
-> 9203 validate=validate,
9204 )
9205
[/usr/local/lib/python3.7/dist-packages/pandas/core/reshape/merge.py](https://localhost:8080/#) in merge(left, right, how, on, left_on, right_on, left_index, right_index, sort, suffixes, copy, indicator, validate)
117 copy=copy,
118 indicator=indicator,
--> 119 validate=validate,
120 )
121 return op.get_result()
[/usr/local/lib/python3.7/dist-packages/pandas/core/reshape/merge.py](https://localhost:8080/#) in __init__(self, left, right, how, on, left_on, right_on, axis, left_index, right_index, sort, suffixes, copy, indicator, validate)
697 self.right_join_keys,
698 self.join_names,
--> 699 ) = self._get_merge_keys()
700
701 # validate the merge keys dtypes. We may need to coerce
[/usr/local/lib/python3.7/dist-packages/pandas/core/reshape/merge.py](https://localhost:8080/#) in _get_merge_keys(self)
1107 right_keys.append(rk)
1108 if lk is not None:
-> 1109 left_keys.append(left._get_label_or_level_values(lk))
1110 join_names.append(lk)
1111 else:
[/usr/local/lib/python3.7/dist-packages/pandas/core/generic.py](https://localhost:8080/#) in _get_label_or_level_values(self, key, axis)
1777 values = self.axes[axis].get_level_values(key)._values
1778 else:
-> 1779 raise KeyError(key)
1780
1781 # Check for duplicates
KeyError: 'tweet_id'
from mumin-build.
Hmm, I have tried replicating this and I can reproduce the error when the bearer key is incorrect, but it works as expected when the bearer key is correct.
Can you verify that the bearer token you are using is correct and valid etc?
from mumin-build.
Related Issues (12)
- Could you consider adding an use example of PyG? HOT 1
- Keep showing this error: KeyError: 'tweet_id'. HOT 5
- The `ids` query parameter value [-xxxx] is not valid HOT 17
- BrokenPipeError: [Errno 32] Broken pipe HOT 2
- BrokenPipeError: [Errno 32] Broken pipe when parsing images HOT 3
- User Posted Reply rows not found HOT 4
- SSL Error HOT 2
- Claim Node HOT 1
- compile method returning 403 and KeyError: 'tweet_id' HOT 9
- ConnectionResetError(104, 'Connection reset by peer') when compiling data set HOT 5
- Parallel Jobs and checkpoints for dataset compile process HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mumin-build.