Tools: Jupiter Notebook / Python / Pandas
Objectives
1. Identify the most popular programming languages among IT professionals.
2. Analyze average salaries and income statistics for IT professionals.
3. Explore the age distribution among IT professionals.
4. Provide statistical information on working hours for part-time and full-time IT professionals.
5. Determine the most popular databases among IT professionals.
6. Examine the relationship between income and various factors such as working hours, age, education, and other variables.
Tools:
Python, Jupyter notebook
Libraries
numpy , pandas , seaborn , matplotlib.pyplot
Dataset:
Our data set is a survey works among IT professional , collected and published on Github in the link below.
It has 11551 records and 84 columns. (11552, 85)
1. Identify the most popular programming languages among IT professionals.
There are unique 28 programming languages , the top most popular ones are:
- JavaScript
- HTML/CSS
- SQL
- Bash/Shell/PowerShell
- Python

2. Analyze average salaries and income statistics for IT professionals.
3. Explore the age distribution among IT professionals.
Summary statistics.
- Mean: 30.77
- Std (standard deviation): 7.37
- Min: 16.00
- 25% (first quartile, Q1): 25.00
- 50% (median, Q2): 29.00
- Max: 72.00
- 75% (third quartile, Q3): 35.00

4. Provide statistical information on working hours for part-time and full-time IT professionals.

Reading into dataset:
The Max of 1,012.00 hours is mathematically impossible (there are only 168 hours in a week). This indicates “noisy” data or data entry errors in the dataset.
Similar to the full-time data, the Max of 375.00 hours is impossible for a single week, suggesting errors in the source data.
Full-time workers are a much larger group in this data and center strictly around a 40-hour week. Part-time workers have a much broader distribution relative to their average, typically working between 20 and 35 hours.
After applying outlier treatment:


5. Determine the most popular databases among IT professionals.
There are 13 unique databases , the top most popular ones are:
MySQL
Microsoft SQL Server
PostgreSQL
SQLite
MongoDB

6. D Examine the relationship between income and various factors such as working hours, age, education, and other variables.
The pair plot shows that experience‑related variables are tightly correlated, compensation rises only weakly with experience, and tool knowledge scales modestly — highlighting that experience is consistent, but not a strong standalone predictor of pay


Leave a Reply
You must be logged in to post a comment.