The Palos Publishing Company

Follow Us On The X Platform @PalosPublishing
Categories We Write About

How to Use EDA to Analyze the Relationship Between Social Movements and Public Policy

Exploratory Data Analysis (EDA) is a powerful method in data science used to uncover patterns, spot anomalies, and test hypotheses through statistical graphics and data visualization techniques. When applied to the study of social movements and public policy, EDA can offer crucial insights into how grassroots activism influences legislative change. This article provides a structured approach to using EDA for analyzing the dynamic relationship between social movements and public policy, emphasizing tools, techniques, and real-world applications.

Understanding the Interplay Between Social Movements and Policy

Social movements are collective efforts aimed at promoting or resisting change in society, often focused on issues such as civil rights, environmental justice, or gender equality. Public policy, on the other hand, encompasses the laws, regulations, and government actions that reflect societal values and power structures.

The relationship between the two is complex: social movements can shape policy by raising awareness, influencing public opinion, and pressuring policymakers. Conversely, policy decisions can spark or reshape social movements. Analyzing this bidirectional relationship using EDA involves identifying trends, correlations, and temporal changes in datasets related to protests, legislation, social media discourse, and public opinion polls.

Step 1: Collecting and Preparing the Data

Data Sources

To begin an EDA project on social movements and public policy, data must be sourced from a range of platforms:

  • Protest datasets: e.g., the Crowd Counting Consortium or Global Database of Events, Language, and Tone (GDELT).

  • Legislative records: such as government websites (e.g., congress.gov), OpenStates, or local policy tracking databases.

  • Social media data: using APIs from platforms like Twitter or Reddit to track movement hashtags and discussions.

  • News articles and media coverage: via aggregators and natural language processing tools.

  • Public opinion surveys: from sources like Pew Research Center or Gallup.

Data Cleaning and Preprocessing

Before analysis, data must be cleaned:

  • Standardize formats (e.g., dates, locations).

  • Remove duplicates or irrelevant entries.

  • Address missing data through imputation or elimination.

  • Aggregate data at a suitable level (e.g., daily protest counts, monthly legislative changes).

Step 2: Defining Analytical Objectives

EDA is most effective when guided by specific questions, such as:

  • Do spikes in protest activity correlate with legislative proposals?

  • What time lag exists between movement activity and policy response?

  • Which types of movements are most successful in driving policy change?

  • How does media attention mediate between activism and policy shifts?

Setting clear objectives helps in choosing the right visualization and statistical tools.

Step 3: Performing Exploratory Data Analysis

Univariate Analysis

Start with descriptive statistics and visualizations for individual variables:

  • Protest frequency over time (line plots).

  • Number of bills proposed per year (bar charts).

  • Distribution of protest sizes (histograms).

Bivariate and Multivariate Analysis

To explore relationships between variables:

  • Correlation analysis: Use Pearson or Spearman correlation coefficients to examine links between protest frequency and number of policy changes.

  • Scatter plots: Visualize relationships between protest intensity and legislative activity.

  • Time series analysis: Overlay protest timelines with legislative events to detect lagged effects.

  • Heatmaps: Show how public sentiment and media coverage vary with movement size or legislative response.

Text and Sentiment Analysis

For qualitative data such as tweets, news headlines, and legislative texts:

  • Word clouds and term frequency-inverse document frequency (TF-IDF) to identify key themes.

  • Sentiment analysis to measure the tone of public and political discourse.

  • Topic modeling using Latent Dirichlet Allocation (LDA) to categorize protest demands and legislative rhetoric.

Step 4: Case Studies and Comparative Analysis

Apply EDA to specific historical or contemporary movements for deeper insight:

  • Black Lives Matter (BLM): Analyze protest data from 2020 alongside police reform bills.

  • Women’s March: Correlate participation rates with gender-related policy initiatives.

  • Climate activism: Evaluate the impact of Extinction Rebellion on environmental legislation.

Cross-comparison of different movements can reveal patterns of success or stagnation and highlight effective advocacy strategies.

Step 5: Geospatial Analysis

Mapping tools can add another layer of understanding:

  • Use choropleth maps to visualize protest density by region and overlay with legislative activity.

  • Perform spatial clustering to detect hotspots of activism or policy responsiveness.

  • Combine geographic data with demographic variables to analyze representation and policy reach.

Step 6: Identifying Trends and Drawing Conclusions

Use the insights from EDA to draw preliminary conclusions:

  • Identify leading indicators of policy change, such as social media surges or coordinated protest days.

  • Spot temporal gaps between activism and policy response to guide strategic planning.

  • Highlight disparities in policy responsiveness by location, movement type, or media attention.

These trends form the basis for deeper inferential analysis or predictive modeling.

Step 7: Communicating Insights

Effective communication is crucial for translating EDA findings into actionable insights:

  • Use interactive dashboards (e.g., Tableau, Power BI, Plotly) for stakeholders to explore the data.

  • Present story-driven visualizations that narrate the evolution of a movement and its policy impact.

  • Tailor reports to audiences—academics, activists, or policymakers—by focusing on relevance and clarity.

Tools and Technologies

Several tools can facilitate the EDA process:

  • Python libraries: pandas, NumPy, Matplotlib, Seaborn, Plotly, spaCy, NLTK.

  • R packages: dplyr, ggplot2, tidytext, lubridate, leaflet.

  • APIs and data platforms: Twitter API, NewsAPI, OpenSecrets, Google Trends.

  • Geospatial tools: QGIS, GeoPandas, Mapbox.

Challenges and Limitations

While EDA provides valuable initial insights, several challenges may arise:

  • Data bias: Media or social media may overrepresent certain movements or demographics.

  • Causality vs correlation: EDA reveals associations, not causal relationships.

  • Data sparsity or noise: Inconsistent reporting or vague protest goals can obscure analysis.

  • Legal and ethical concerns: Privacy, consent, and misinformation must be managed, especially with social media data.

Conclusion

Exploratory Data Analysis is an essential tool for researchers, policymakers, and activists seeking to understand how social movements affect public policy. By integrating diverse datasets, using robust visualization techniques, and maintaining a critical perspective, EDA helps uncover the nuanced ways in which collective action shapes the legislative landscape. Though it is only the first step in a comprehensive analytical pipeline, it provides the foundation for deeper understanding and more informed advocacy.

Share this Page your favorite way: Click any app below to share.

Enter your email below to join The Palos Publishing Company Email List

We respect your email privacy

Categories We Write About