Comments (9)
Numba does not yet support the RPi3 (armv7 using the typical raspbian install), although we have a PR to add support here:
As noted in the PR discussion, not all of our unit tests pass on ARMv7 for some reason.
from fastparquet.
I also have newer pandas for ARM in my Anaconda Cloud channel:
https://anaconda.org/seibert/pandas
from fastparquet.
I believe that all necessary packages run on ARM. @seibert would know more.
from fastparquet.
snappy is not a hard dependency, but without it, you can't use that compression, of course.
from fastparquet.
If you get fastparquet working on a raspberryPi, I for one definitely want to know your use case!
from fastparquet.
Installed Miniconda on my pi3...
then conda install -c conda-forge fastparquet...no armv6 packet okay
with pip install git+https://github.com/dask/fastparquet i get a issue about the llvmlite modul...
conda install llvmlite solve this problem.
....more to come
from fastparquet.
@rddaz2013 , any success? Would be interested to know your use case.
from fastparquet.
- Fresh PI-3
- Install Mini-conda + requirements
pi@raspberrypi:~/fastparquet/fastparquet/test $ python
Python 3.4.3 |Continuum Analytics, Inc.| (default, Aug 21 2015, 00:53:08)
[GCC 4.6.3] on linux
-
Modify fastparquet to work without Numba...
-
Problem the mini-conda pandas for arm is Version 0.16.2
> pi@raspberrypi:~/fastparquet/fastparquet/test $ python
Python 3.4.3 |Continuum Analytics, Inc.| (default, Aug 21 2015, 00:53:08)
[GCC 4.6.3] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import pandas as pd
>>> pd.__version__
'0.16.2'
with this i get an error running test_api.py
pi@raspberrypi:~/fastparquet/fastparquet/test $ python test_api.py
Traceback (most recent call last):
File "test_api.py", line 8, in <module>
from fastparquet.util import tempdir
File "/home/pi/miniconda3/lib/python3.4/site-packages/fastparquet-0.0.3-py3.4.egg/fastparquet/__init__.py", line 16, in <module>
from .writer import write
File "/home/pi/miniconda3/lib/python3.4/site-packages/fastparquet-0.0.3-py3.4.egg/fastparquet/writer.py", line 21, in <module>
from . import encoding, api
File "/home/pi/miniconda3/lib/python3.4/site-packages/fastparquet-0.0.3-py3.4.egg/fastparquet/api.py", line 18, in <module>
from . import core, schema, converted_types, encoding, writer, dataframe
File "/home/pi/miniconda3/lib/python3.4/site-packages/fastparquet-0.0.3-py3.4.egg/fastparquet/dataframe.py", line 6, in <module>
from pandas.core.index import RangeIndex, Index
ImportError: cannot import name 'RangeIndex'
RangeIndex -> pandas v 0.18+ ??
from fastparquet.
" I for one definitely want to know your use case! "
- Proof of Concept for my 3 Working system..(Win, Linux and the PI)
- Python + Scipy + Numpy + matplotlib ( today only python2.10 but work for python3 in progress)
- temperature/pressure records with time intervall from 1/10 ms to minutes + duration of measurement of minutes or month.
- raspberry pi = independent data collector for slow data and event logging.
- At the moment the source data is..first stage TXT! (~ up to 200 MB) second stage Numpy.arrays... works for me, but is not a standardized data format.
Not really BIG-Data, but too much for Excel .-)
from fastparquet.
Related Issues (20)
- fastparquet encoding issue. HOT 20
- BUG: reading boolean column with RLE encoding gives wrong values HOT 4
- fastparquet cannot read a categorical column that contains NaNs only HOT 2
- to_pandas(): cramjam.DecompressionError: snappy: output buffer (size = 262144) is smaller than required (size = 1048576) HOT 1
- BUG: dataframe.empty with non-nano pd.DatetimeTZDtype HOT 2
- a python-3.12 windows wheel HOT 13
- Some `fastparquet`-related tests are failing on Python 3.10 HOT 10
- Regression due to `_from_sequence` HOT 1
- attrs persistance for Pandas HOT 1
- Nullable types for 1 row vs multiple rows HOT 3
- update_file_custom_metadata error when file has no properties.
- schema evolution when writing the row groups does not work HOT 4
- Bug loading parquet files with timezone information HOT 6
- When changing to a larger dtype, its size must be a advisor of the total size in bytes of the last axis of the array HOT 6
- PyArrow will become a required dependency with pandas 3.0
- Option to not close() after write() when writing to buffer HOT 3
- Support zoneinfo.ZoneInfo timezones
- Loading List of List of Strings leads to nans HOT 6
- Upcoming pandas (>2.2.0) raises "read-only" errors HOT 3
- Categorical dtype not preserved with fastparquet-write, pyarrow-read HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fastparquet.