Comments (4)
- Currently, this project does not support multiple physical nodes... I'm not sure how to implement it. The problem is Hadoop use SSH to commute, which will cause problem when containers run on different physical nodes.
- Currently, this project runs HDFS in container, which means the data will be deleted when the container is deleted. So, to use it in production, you need to put HDFS on host node using "docker volume". In addition, you need to run multiple master to ensure availability. If you solve mentioned problems. I think it is OK to run Hadoop in container for production. Of course, you need to run more tests before using it.
from hadoop-cluster-docker.
Thanks @kiwenlau. For your first response, I think setting up docker swarm will work, however my team and I are yet to test this. Will update you when this is done.
We will also test docker volume and provide our feedback soon.
Thanks again
from hadoop-cluster-docker.
Any updated guys?. I would like to use HDFS on multi-host cluster i.e., docker running on each host machine and share data among themselves...
from hadoop-cluster-docker.
@kiwenlau I don't know why only with your docker code, the nodemanager in running in slaves. If i use your components of code in my docker file, it's not working(nodemanager in not running in slaves)...
from hadoop-cluster-docker.
Related Issues (20)
- 这个项目还维护吗?感觉项目主维护不是很积极的样子 HOT 1
- 无法访问web界面 HOT 1
- 无法用gedit打开文件 HOT 1
- master and docker HOT 2
- 搭环境是不可能学会的啦 搭环境的这辈子都不可能的啦 HOT 1
- serf members bash: serf: command not found
- How to install other components such as kafka flume and spark
- There are not pip or pip3 in the docker...
- docker cp command
- sudo ./resize-cluster.sh 5 HOT 1
- master 和 slave 之间没有交换公钥是怎么实现相互免密登陆的
- start-container.sh错误 HOT 1
- ../
- start-container.sh 运行这个output窗口出现就消失了,然后也没看到启动成功
- start-container.sh 执行问题
- cannot download file from web HOT 1
- Bad owner or permissions on /root/.ssh/config
- 这个启动之后可以宿主机通过容器ip访问容器的hadoop服务吗?
- 能不能升级到hadoop 3.3+
- library initialization failed - unable to allocate file descriptor table - out of memory./run-wordcount.sh: line 28: 258 Aborted (core dumped) hdfs dfs -cat output/part-r-00000
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hadoop-cluster-docker.