The Palos Publishing Company

Follow Us On The X Platform @PalosPublishing
Categories We Write About

How to Study the Impact of New Technologies on Job Creation Using Exploratory Data Analysis

Studying the impact of new technologies on job creation using Exploratory Data Analysis (EDA) involves a multi-step process of collecting relevant data, preprocessing it, and visualizing patterns and trends that inform conclusions. EDA helps uncover the underlying structure of data and identify significant correlations and changes in employment patterns due to technological advancement. This approach does not establish causality but provides valuable insights and direction for more rigorous statistical or econometric analysis.

1. Define the Scope and Objective

The first step is to clearly define what constitutes “new technologies” and how job creation is measured. This includes:

  • Technologies of Interest: Artificial Intelligence (AI), Machine Learning (ML), Robotics, Blockchain, Internet of Things (IoT), Automation, etc.

  • Job Metrics: Employment rates, new job titles or roles, wage changes, job vacancies, and labor productivity.

Define whether the analysis will be industry-specific (e.g., manufacturing, healthcare, IT) or broader. Also determine the geographic focus — global, national, or regional.

2. Data Collection

To conduct an effective EDA, compile datasets from various sources:

  • Government and International Bodies: Bureau of Labor Statistics (BLS), OECD, World Bank, Eurostat.

  • Job Market Platforms: LinkedIn, Glassdoor, Indeed (job postings, role descriptions).

  • Tech Investment Sources: Crunchbase, Statista (start-up funding, adoption rates).

  • Patent Databases: WIPO, USPTO (to track innovation).

  • Industry Reports: McKinsey, PwC, Deloitte (technology adoption trends and forecasts).

Ensure the data is longitudinal to analyze trends over time and include demographic attributes (age, education level, location) for segmentation.

3. Data Cleaning and Preprocessing

Raw data is rarely clean. Preprocessing steps involve:

  • Handling Missing Values: Use imputation techniques or remove incomplete rows.

  • Standardization: Normalize employment figures and technology adoption rates to allow comparison.

  • Categorization: Group jobs into sectors, skill levels, or job families.

  • Time Series Formatting: Ensure all data have consistent time stamps for temporal analysis.

Data preprocessing is crucial to improve the quality and interpretability of EDA.

4. Feature Engineering

Create new variables that can better reflect the influence of technology on jobs:

  • Tech Exposure Index: Percentage of tasks within jobs that can be automated or are aided by technology.

  • Job Creation Rate: Net increase in number of jobs per sector over a period.

  • Technology Investment Ratio: Investment in tech tools per employee in a sector.

  • Adoption Speed: Years since adoption of specific technologies in various industries.

These features help bridge the gap between abstract technology indicators and measurable employment outcomes.

5. Descriptive Statistics and Correlation Analysis

Use basic statistical measures to summarize data:

  • Central Tendency: Mean and median job growth in tech vs. non-tech sectors.

  • Dispersion: Standard deviation of employment growth across industries.

  • Skewness: Whether job creation is heavily concentrated in few sectors.

Apply correlation analysis (Pearson or Spearman) between:

  • Technology adoption rates and job creation

  • R&D spending and employment in high-skill sectors

  • Automation level and displacement rates in low-skill jobs

This step helps understand linear or monotonic relationships.

6. Data Visualization for Pattern Recognition

Visualization is the core of EDA. It helps detect patterns, trends, and anomalies:

  • Line Charts: Show job creation over time by sector with technology milestones marked.

  • Bar Charts: Compare job growth in tech-intensive vs. traditional sectors.

  • Heatmaps: Highlight correlation between variables (e.g., AI adoption vs. employment rate).

  • Boxplots: Show distribution of wage growth in jobs impacted by automation.

  • Scatter Plots: Explore relationship between startup funding in AI and new tech job listings.

  • Stacked Area Charts: Reveal shifts in job categories over time.

Using interactive dashboards (e.g., with Tableau or Plotly) can aid dynamic exploration of the data.

7. Segmentation Analysis

Different groups may experience technology’s impact differently. Segment data by:

  • Industry: Compare manufacturing, retail, healthcare, IT.

  • Job Type: Routine vs. non-routine, cognitive vs. manual.

  • Skill Level: Low-skilled vs. high-skilled jobs.

  • Region: Urban vs. rural, developed vs. developing countries.

  • Demographics: Age, gender, education.

This granular view provides insights into inequality or differentiated effects of new technologies on employment.

8. Trend Analysis and Time Series Decomposition

To understand temporal dynamics:

  • Decompose Time Series: Separate employment data into trend, seasonal, and irregular components.

  • Moving Averages: Smooth short-term fluctuations to identify long-term trends.

  • Change Point Detection: Identify when structural shifts occurred in employment data.

This can help pinpoint if technological breakthroughs coincide with shifts in job creation.

9. Text Mining of Job Descriptions and Patents

Natural Language Processing (NLP) techniques can be used on unstructured data:

  • Topic Modeling: Identify emerging job roles or skills from job postings.

  • Keyword Frequency: Track growth in demand for tech-related skills.

  • Semantic Analysis: Understand how roles evolve with tech incorporation.

Patent text analysis can also reveal the nature of innovations influencing labor markets.

10. Cluster Analysis

Unsupervised learning methods like K-means or hierarchical clustering can group similar industries or regions based on their tech adoption and job growth profiles. For example:

  • High adoption–high job creation

  • High adoption–low job creation (potential job displacement)

  • Low adoption–moderate job stagnation

This helps identify archetypes and outliers for deeper qualitative investigation.

11. Cohort and Transition Analysis

Evaluate how individuals or job categories transition due to technology:

  • Cohort Tracking: Follow graduates of tech-related fields and their employment trajectories.

  • Job Transitions: Map how roles evolve — e.g., administrative assistants transitioning to project coordinators due to automation.

  • Skill Shifts: Measure how required skills for the same job change over time.

This analysis highlights adaptive pathways and potential reskilling opportunities.

12. Limitations of EDA in this Context

While EDA is powerful for pattern recognition, it has limitations:

  • Correlation ≠ Causation: EDA shows associations, not causal links.

  • Bias in Data: Job postings might not represent actual employment; missing informal economy.

  • Lag Effects: Tech adoption may have delayed impacts not captured in short-term data.

  • Dynamic Interactions: Technological impact can vary depending on regulation, culture, or education systems.

These challenges should guide the interpretation of findings and encourage further hypothesis testing.

13. Recommendations for Further Research

EDA can guide deeper analysis by pointing to promising hypotheses. For instance:

  • Use econometric models to establish causality between technology investment and employment changes.

  • Design longitudinal studies tracking workers’ career paths post-automation.

  • Develop predictive models for sectors at risk or ripe for job growth due to emerging tech.

14. Conclusion

Exploratory Data Analysis provides a powerful toolkit to uncover the multifaceted impact of new technologies on job creation. By integrating diverse datasets, crafting meaningful features, and applying insightful visualizations, researchers and policymakers can better understand how technological advancements shape the future of work. Though EDA does not confirm causality, it serves as a critical first step toward evidence-based strategies for managing technological transitions in the labor market.

Share this Page your favorite way: Click any app below to share.

Enter your email below to join The Palos Publishing Company Email List

We respect your email privacy

Categories We Write About