How to Study the Impact of COVID-19 on Global Supply Chains Using EDA

Studying the impact of COVID-19 on global supply chains through Exploratory Data Analysis (EDA) involves using a data-driven approach to understand and visualize the disruptions that occurred. EDA helps uncover hidden patterns, correlations, and outliers, which is crucial for understanding how the pandemic affected supply chains. The following steps outline how to conduct this analysis:

1. Collect Relevant Data

The first step is to gather data that reflects various aspects of global supply chains. Sources of this data can include:

Trade and logistics data: Port traffic, shipment volumes, customs data, and international trade flows. These can provide insights into the movement of goods across borders.
Economic indicators: GDP, unemployment rates, inflation rates, and other economic indicators can show how economic activity was impacted during the pandemic.
Industry-specific data: Data on industries such as manufacturing, retail, and agriculture can help examine how specific sectors were affected by disruptions.
COVID-19 infection and response data: Cases, vaccination rates, and government measures (lockdowns, quarantines) can help correlate the pandemic’s direct impact on supply chains.
Company-level data: Financial reports, production volumes, supply chain interruptions, and delays from major companies (e.g., Amazon, Apple, etc.).

Data sources can be obtained from governmental agencies, supply chain companies, economic databases, and even scraping relevant websites and reports.

2. Data Preprocessing

Once the data is collected, it’s essential to preprocess it to ensure its usability for EDA. This involves:

Cleaning the data: Handling missing values, removing duplicate records, and correcting erroneous data points.
Normalization and scaling: For example, trade volumes could span several orders of magnitude, so normalizing or scaling the data ensures all features are on the same level.
Feature engineering: Creating new features that could offer better insights, like calculating weekly or monthly changes in supply chain metrics.

3. Visualizing Key Metrics

Using visualizations to explore the data is the heart of EDA. Different visual tools help detect trends and anomalies:

Time series analysis: Plotting trade volumes, port traffic, or production levels over time can help visualize disruptions. Comparing periods before and after key events, such as the initial lockdowns, can provide insights.
Heatmaps: Heatmaps can help visualize regional or global supply chain disruptions. For example, which countries were affected more or less by trade disruptions, or where delays were most pronounced.
Bar and line charts: These can be used to compare industry-specific data such as inventory levels, production rates, and sales volumes across different periods.
Scatter plots: Use scatter plots to explore correlations, like the relationship between COVID-19 case numbers and shipping delays or how economic indicators correlate with trade disruptions.

4. Correlation Analysis

Exploring correlations between different variables is essential for understanding how COVID-19 impacted supply chains. For instance:

Case numbers and shipping delays: Analyzing the correlation between COVID-19 case rates and the frequency or length of supply chain delays.
Economic indicators and supply chain performance: Understanding how GDP changes, unemployment rates, and inflation are correlated with disruptions in supply chains.
Government response measures and logistics: Analyzing how government-imposed restrictions such as lockdowns, curfews, and border closures affected trade.

Tools like Pearson or Spearman correlation can quantify these relationships.

5. Anomaly Detection

Using EDA for anomaly detection is crucial in understanding abnormal supply chain behavior during the pandemic. Some key techniques include:

Z-scores: Calculate the Z-score to identify data points that significantly deviate from the average. This could highlight sudden disruptions like spikes in delays or abnormal changes in inventory levels.
Box plots: A box plot can help identify outliers in the data. For instance, a sudden drop in production levels or a significant change in delivery times could be flagged as anomalies.

6. Segmentation Analysis

In some cases, it’s useful to segment the data into subgroups based on certain criteria, such as:

Geographic segmentation: Compare supply chain disruptions in different regions or countries to see how global versus regional supply chains were affected.
Sector-specific segmentation: Different industries (manufacturing, retail, agriculture) experienced varying degrees of disruption. Segmenting the data by industry allows for a detailed understanding of the specific challenges each faced.
Supplier vs. Consumer segmentation: Some supply chains were more focused on suppliers, while others were more consumer-centric. Analyzing these separately helps to understand the differing impacts on upstream and downstream supply chain components.

7. Time Lag Analysis

The pandemic’s impact was not instantaneous. Delays or disruptions often occurred weeks after the initial lockdowns. Therefore, understanding the time lag between COVID-19 outbreaks and the effects on supply chains is crucial. Some ways to study this include:

Lag analysis: Use time-series data to observe how delays or disruptions in one part of the supply chain (like production shutdowns in China) ripple through the global supply chain over weeks or months.
Cross-correlation: This technique can show how changes in one variable, such as COVID-19 case numbers, are related to changes in another variable, such as shipping times.

8. Clustering and Grouping

Clustering techniques, such as K-means or hierarchical clustering, can be used to group regions, industries, or companies with similar supply chain behaviors. For example:

Cluster analysis of disruptions: Identify which regions or industries experienced similar levels of disruption, and group them accordingly. This can help policymakers and businesses understand which sectors or areas may need more targeted interventions in future crises.

9. Hypothesis Testing

At this stage, you may want to test hypotheses about the impact of COVID-19 on supply chains. For example:

Did countries with stricter lockdowns experience more severe disruptions in global trade?
Was the electronics industry more or less resilient to disruptions compared to the food industry?

Statistical tests like T-tests, chi-square tests, and ANOVA can be used to test these hypotheses based on the EDA findings.

10. Drawing Conclusions and Reporting Insights

The final step is to synthesize the insights derived from the data and make conclusions about the impact of COVID-19 on global supply chains. This might include:

Highlighting the most severely affected sectors or regions.
Identifying the key drivers of disruptions, such as port congestion, labor shortages, or raw material shortages.
Offering recommendations for supply chain resilience, such as diversification of suppliers, increasing stockpiles, or investing in digital technologies.

This process of using EDA helps ensure that the conclusions are backed by data and can be used for future planning, policy-making, and industry strategy.

Conclusion

EDA is an invaluable tool for studying the complex impacts of COVID-19 on global supply chains. By using a combination of data collection, visualization, statistical analysis, and segmentation techniques, businesses and policymakers can gain a deep understanding of the disruptions and design strategies to mitigate the impact of future global crises on supply chains.

Share This Page: