Skip to content

layamaryjoy/Employee-Data-Analysis-ABC-Company

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

30 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“Š ABC Company Employee Data Analysis Project

Python Data Analysis Visualization Platform

A comprehensive Data Analytics & Visualization Project focused on extracting meaningful insights from employee data using Python.


πŸš€ Run Notebook in Google Colab

Click below to open the notebook:

Open In Colab


πŸ“˜ Project Overview

This project provides a detailed analysis of employee data from ABC Company.

  • Dataset contains 458 rows and 9 columns

  • Focuses on workforce insights, salary distribution, and team structure

  • Builds a data-driven report for business decision-making

🎯 Objective

The project includes:

πŸ”Ή Data Preprocessing

πŸ”Ή Exploratory Data Analysis (EDA)

πŸ”Ή Business-driven analytical tasks

πŸ”Ή Data visualizations

πŸ”Ή Insight generation & storytelling

πŸ”Ή Clean and reproducible Google Colab notebook


πŸ“‚ Dataset Description

Column Name Description
empid Employee ID
age Age of the employee
gender Gender
height Height (corrected during preprocessing)
weight Weight
team Department
position Job role
salary Monthly salary
experience Years of experience

🧹 Data Preprocessing

The following preprocessing steps were performed:

βœ” Ensured dataset consistency and quality

βœ” Standardized column names (lowercase, underscores)

βœ” Handled missing values

  • Numeric β†’ Median
  • Categorical β†’ Mode / "Unknown"

βœ” Removed duplicate records

βœ” Corrected height values (150–180 cm range)


πŸ“ˆ Analysis Tasks

πŸ” Task 1: Team Distribution

  • Employee count by team

  • Percentage distribution

  • Identified largest team

  • Visualized using bar & pie charts


πŸ§‘β€πŸ’Ό Task 2: Employee Position Segregation

  • Grouped employees by job role

  • Percentage distribution

  • Identified common roles

  • Visualized using count plot


πŸŽ‚ Task 3: Predominant Age Group

  • Created age groups (bins)

  • Counted distribution

  • Visualized using bar chart


πŸ’° Task 4: Salary Expenditure Analysis

  • Total salary by team

  • Total salary by position

  • Visualized using bar charts


πŸ“‰ Task 5: Age–Salary Relationship

  • Calculated correlation between age and salary

  • Visualized using scatter plot


🎨 Visualizations

Analysis Visualization
Team Distribution Bar chart, Pie chart
Position Segregation Count plot
Age Groups Bar chart
Salary Analysis Bar charts
Age vs Salary Scatter plot

βœ” Consistent theme, labels, and color palette applied across all charts


🧠 Key Insights

βœ” Majority of employees fall in the 25–35 age group

βœ” A positive correlation exists between age and salary

βœ” Senior roles contribute significantly to salary expenditure


πŸ“Œ Business Impact

This analysis helps organizations to:

  • Plan workforce distribution

  • Optimize salary budgets

  • Improve hiring strategies

  • Support data-driven decision-making


⚠️ Limitations

  • The dataset is relatively small (458 records), which may limit the depth of insights.

  • Data is static and does not reflect real-time workforce changes.

  • Some values (e.g., height) were artificially adjusted, which may affect analysis accuracy.

  • Limited features; important factors like performance ratings, department budgets, or education level are not included.

  • Correlation analysis does not imply causation between variables (e.g., age and salary).

  • Results may not generalize to other organizations with different workforce structures.

  • Visualization-based insights are descriptive and do not include predictive modeling.

πŸ›  Tech Stack

Tool Purpose
Python Programming language
Pandas Data processing
NumPy Numerical operations
Matplotlib / Seaborn Visualization
Google Colab Development environment

πŸ“ Repository Structure

ABC-Company-Employee-Analysis/

β”‚

β”œβ”€β”€ ABC Company.xlsx

β”œβ”€β”€ Python Module End Assessment 2.ipynb

β”œβ”€β”€ README.md


πŸš€ How to Run the Project

1️⃣ Open Notebook

Click the Google Colab link above

2️⃣ Run Cells

Execute all cells sequentially

3️⃣ View Results

Explore visualizations and insights


πŸ“Œ Academic Submission

This repository was created as part of a Python Module End Assessment in a Data Analytics program to demonstrate data preprocessing, exploratory data analysis (EDA), visualization techniques, and deriving business insights from employee data.


πŸ‘€ Author

Name: Laya Mary Joy

Organization: Entri Elevate

Date: January 15, 2026


⭐ Acknowledgment

Thanks to Entri Elevate for guidance and support.


πŸ“Œ Future Improvements

  • Add interactive dashboards (Power BI / Streamlit)

  • Include advanced statistical analysis

  • Integrate real-time employee datasets

  • Enhance visualization interactivity


About

Comprehensive analysis of ABC company employee data, including team distribution, position segregation, age demographics, salary expenditure, and correlations, with visualizations to support workforce insights.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors