Comments (6)
Thank you for reporting this @ametwalli1
When the hostname is not present, the correct syntax for xdfs compatible filesystem should be xdfs:///.
Could try using dbfs:///mnt/datasets_area/dbfs:/mnt/pending_area
instead and let me know how this work for you.
from starlake.
Thank you very much @hayssams, using dbfs:///mnt/pending_area
as SL_AREA_PENDING is working fine !
However, I have an other trouble here.
When doing the import with this setting, starlake pushes all the source files without filtering by domain. This was done by default in the DatasetArea.scala path function that would add the /$domain variable at the end of the path. But it doesn't do it anymore.
So in my situation if I wanted to create the domain directories in my pending area, I thought using dbfs:///mnt/pending_area/$domain
would work in my cluster environment. It didn't change anything ($domain was ignored), so I tried dbfs:///mnt/pending_area/{domain}
instead and it created a {domain} directory.
Do you know how can I use variables in my cluster environment ?
from starlake.
Could you try {{domain}} instead and let me know
from starlake.
dbfs:///mnt/pending_area/{{domain}} creates a {{domain}} directory also.
The only way to use variables is done by using /$domain but I guess that you have some regex security check and it is ignored.
The better way to solved this would be to add the /$domain at the end of the area in DatasetArea.scala line 50.
Or maybe there is another way to introduce variables.
from starlake.
Can We setup a call 5:30 PM?
from starlake.
Sure ! Here is my email : [email protected]
from starlake.
Related Issues (17)
- [FEATURE] - Parallel Ingestion Mode
- [DOC] - Add JDBC Sink sample
- [DOC] - Example of ORC support
- [FEATURE] - Improve performance when loading a huge number of files in GROUPED mode HOT 6
- [FEATURE] - Add more fields in xlsx files to get around the excel sheet name limit and have more descriptive name HOT 1
- [BUG] - Can't use DML statements in autotask job HOT 5
- [BUG] - JQ not installed on user host
- [BUG] - Path error HOT 15
- [BUG] - Accepted row counting is incorrect in Audit table HOT 1
- [TU] - Enrich Unit test po HOT 1
- [FEATURE] - Support Schema on write
- [BUG] - Infer-schema has an input file error while running on databricks HOT 3
- [DOC] - Add documentation for all ENV VARS
- [BUG] - Unit Test for Absolute dataset area path HOT 1
- [BUG] - BigQuery - Missing `audit` table when using a custom sink name
- [DOC] - Databricks on Azure HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from starlake.