Comments (4)
@alexott - thanks for reporting this & sorry for the delayed response.
I wasn't able to replicate your issue (was able to successfully attach chispa v0.6.0 to a Databricks cluster via PyPi & manually attaching the wheel), but I'm a Python n00b and I'm sure your point is valid.
In the Scala world, we add the Spark dependency like libraryDependencies += "org.apache.spark" %% "spark-sql" % "2.4.4" % "provided"
and the "provided" part is what makes Spark a "soft" dependency.
I updated PySpark to be a "dev-dependency" rather than a regular dependency:
[tool.poetry.dependencies]
python = ">2.7"
[tool.poetry.dev-dependencies]
pytest = "3.2.2"
pytest_describe = "^1.0.0"
pyspark = ">2.0.0"
I published chispa v0.7.0. Can you please try it out and let me know if it fixes your issue? Thanks!
from chispa.
Thank you very much Matthew! I'll check on Monday, when I get to my work laptop...
from chispa.
Thank you! I just checked, it works just fine now
from chispa.
@alexott - thanks for confirming!
If you ever have any additional recommendations for this library, just let me know, thanks!
from chispa.
Related Issues (20)
- pkg_resources is deprecated, prettytable produces warning HOT 1
- Reconfigure CI to run tests on pull requests
- Run tests for multiple PySpark versions in CI HOT 5
- Refactor code to conform to PEP8 HOT 5
- Font colors in error messages are bad in some terminals HOT 1
- ignore_schema in assert_df_equality removed in 9.3? HOT 6
- assert_df_equality throws SchemasNotEqualError when the dataframes are identical (except for the metadata) HOT 3
- Replace poetry dev-dependencies with dependency groups HOT 2
- Add unit tests to highlight limitations of this library HOT 1
- Make proper StructField comparer and DataType comparer abstractions
- Give user control to customize output formatting HOT 2
- SchemaNotEqualError not showing the difference in metadata
- Issues on Python 3.10 due to six 1.15.0 HOT 4
- Unit tests are only run against a single version of Python on the `main` branch.
- underline_cells failing if dataframes are different lengths HOT 1
- assert None while ignore_metadata=True
- Unit testing the code with Spark Connect
- SchemaNotEqual error is unreadable for wide schemas HOT 1
- chispa 1.0 release
- Investigate "SPARK_TESTING" environment variable
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chispa.