sebc's People
sebc's Issues
Install CM
- Configure the CM repo to install the latest 5.9 release on on the second node listed before
- List the command and output of
ls /etc/yum.repos.d
inchallenges/labs/2_cm.md
- Copy the
cloudera-manager.repo
file tochallenges/labs/2_cloudera-manager.repo.md
- List the command and output of
- Use the
scm_prepare_database.sh
script to write yourdb.properties
file, list the full command line in2_cm.md
- Start the Cloudera Manager server. Then in
challenges/labs/2_db.properties.md
:- Add the first line from your server log
- Add the log line that contains the phrase "Started Jetty server"
- Copy your
db.properties
file tochallenges/labs/2_db.properties.md
YARN/RM Lab
YARN/RM Lab
- Doing the Math
- Static Service Pools
- Tuning for YARN
CM Lab
CM Lab
- Monitoring Lab
- Create a Custom Dashboard
- Use the API
- Upgrade Cloudera Manager
Install CDH
- Install the latest CDH release; deploy Coreset services only
- Rename your cluster after your GitHub handle
- Create user directories in HDFS for
neymar
andronaldo
- Add the following to
3_cm.md
:- Command and output for
hdfs dfs -ls /user
- The output from the CM API call
../api/v14/hosts
- Command and output for
- Login to Hue and install the Hive sample data
- Capture a Hue screen that lists the Hive tables to
challenges/labs/3_hue_hive.png
- Capture a Hue screen that lists the Hive tables to
Storage labs
HDFS Lab tasks
- Replicate to another cluster
- Test HDFS performance
- Test HDFS Snapshot
- Enable HDFS HA
Challenges Setup
- Create the file
challenges/labs/0_setup.md
with:- List the cloud provider you are using (AWS, GCE, Azure, other)
- List the Linux release you have chosen
- Show that the disk space on each node is at least 30 GB
- List the command and output for
yum repolist enabled
- Add the following Linux accounts to all nodes
- User
neymar
with a UID of2010
- User
ronaldo
with a UID of2016
- User
- Create the group
barca
and addronaldo
to it - Create the group
merengues
and addneymar
to it - List the
/etc/passwd
entries forneymar
andronaldo
in your setup file - List the
/etc/group
entries forbarca
andmerengues
in your setup file
Install MySQL
- Install a MySQL 5.6.x server on the node you listed first
- Copy
/etc/yum.repos.d/mysql-community.repo
tochallenges/labs/1_my-database-server.repo.md
- Install the MySQL client package and JDBC connector jar on all clients
- Start the
mysqld
service - Create the following databases
scm
rman
hive
oozie
hue
sentry
- Put the following in the file
challenges/labs/1_db-server.md
- The hostname of your MySQL node
- The command and output for
mysql --version
- The command and output for listing MySQL databases
HTML errors`intro.md`
overview/intro.md
contains syntax errors over the href
tags in Course Objectives and Course Plan section.
Installation Lab
CM Install Lab
- System Configuration Checks
- Configure MySQL with a replica server
- CM/CDH Install Lab
- Create a local parcel repo (manual) (BONUS)
Test HDFS
- As user
neymar
, useteragen
to generate a 65,536,000-record dataset into eight files- Set the block size to 16 MB
- Name the target directory
tgen640
- Use the
time
command to capture job duration
- Put the following in the file
challenges/labs/4_teragen.md
- The full
teragen
command - The output of the
time
command - The command and output of
hdfs dfs -ls /user/neymar/tgen640
- Show how many blocks are associated with this directory
- The full
HUE Lab
- Authenticate using Linux users/groups
- Security or Availability?
CDH Security
CDH Security
- Implement TLS Level 1 Security
- Security Labs Preparation
- Integrating Kerberos with Cloudera Manager
- Sentry Lab
Update Cluster
- Update Cloudera Manager to the latest available release
- Capture the command used to determine the API version available, along with the output.
- Store them in the file
6_cm_latest.png
- Upgrade CDH to the latest available release
- Capture the CDH version and the services that are running
- Store it in the file
6_cdh_latest.png
Evaluation: Full Pass
Based on the labs and challenge work submitted, I am pleased to offer you a Full Pass. Congratulations!
You will receive an email notification when all the work submitted for the Madrid class has been processed.
Thank you for your hard work and diligence throughout the class.
Kerberize cluster
- Install an MIT KDC on the same node as the CM server
- Name your realm after your GitHub handle
- Use
ES
as a suffix - For example:
MFERNEST.ES
- Create Kerberos principals for
neymar
,ronaldo
, andcloudera-scm
- Give
cloudera-scm
the privileges needed to create principals and keytabs
- Give
- Enable Kerberos for the cluster
- Run the
terasort
program asneymar
using/user/neymar/tsort640m
- Copy the command and output to
challenges/labs/5_terasort.md
- Copy the command and output to
- Run the Hadoop
pi
program as the userronaldo
- Copy the command and output to
challenges/labs/5_pi.md
- Copy the command and output to
- Copy only text files in
/var/kerberos/krb5kdc/
to your repo as follows:- Add the prefix
5_
and the suffix.md
- Example:
5_kdc.conf.md
- Add the prefix
Welcome to SEBC!
Your setup looks good so far. If you have not already invited rsiwicki
as a Collaborator, please do so.
Once he has accepted, you may remove the incomplete
label from this Issue and close it.
Good luck this week!
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.