- Data wrangling
- Data cleaning
- SQL (MySQL)
- Python
- PowerBI
A dataset of HR in the United States of America from the year 2000 to 2020 was used. The SQL cleaning was based on a video by Inere Arch, with some differences. Python was another tool used to practice and observe the results. The challenger has 11 questions to answer (answered an be found in the files) after cleaning the data.
- What is the gender breakdown of employees in the company?
- What is the race/ethnicity breakdown of employees in the company?
- What is the age distribution of employees in the company?
- How many employees work at headquarters versus remote locations?
- What is the average length of employment for employees who have been terminated?
- How does the gender distribution vary across departments and job titles?
- What is the distribution of job titles across the company?
- Which department has the highest turnover rate?
- What is the distribution of employees across locations by state?
- How has the company's employee count changed over time based on hire and term dates?
- What is the tenure distribution for each department?
- There are more male employees - Males 50.97% x Females 46.28% x Non-Conforming 2.75%.
- The main ethnic group is White, representing 28.52% of the employees. The least dominant group is Native Hawaiian or Other Pacific Islanders, accounting for 5.44%.
- Numerous employees fall within the age range of 34-42, followed by 35-44, while the smallest group is within the range of 51-60.
- The oldest employee is 57 years old, and the youngest is 20.
- The average length of employment for terminated employees is approximately 7 and a half years.
- The number of employees has increased over the years.
- There are more employees working remotely, accounting for 74.99%, compared to those working in headquarters, which is 25.01%.
- The number of employees from Ohio is higher than the combined total from other states.
- Among all the employees' home states, Wisconsin has the fewest employees.
- The department with the highest turnover rate is Auditing, while the department with the lowest turnover rate is Marketing.
Project inspired by Irene (@herdataproject)