blog insightscommunity

DrivenData 10-Year Impact Report: Three pathways to creating social impact with data science and AI

An overview of how DrivenData’s impact is built through projects, portfolios, and people working together.

Tom Harrington
Director of Growth

Since 2014, DrivenData has invested in three reinforcing pathways to use data science and AI for social impact. The latest DrivenData Impact Report frames this clearly: it shows how progress comes through a combination of projects, portfolios, and people, each building on the other.

Projects

The primary driver of social impact is through individual projects.

At the project level, impact is most tangible. Projects take messy, real-world problems and turn them into deployable solutions.

“Our founding vision was to apply data science and machine learning to specific problems faced by mission-driven organizations so they could expand their social impact.”
— Peter Bull, Co-Founder

Through more than 160 projects to date, DrivenData has collaborated with mission-driven organizations to develop needed data, AI models, and tools and prove their value by improving health diagnoses, classroom learning experiences, delivery logistics, and environmenmtal monitoring.

Beluga whale identification visualization
DrivenData competition solvers developed models to re-identify individual Beluga whales, enabling noninvasive monitoring. Explainability heatmaps show distinctive dorsal ridge features (highlighted in yellow/green). Image source: Competition winner Raphael Kiminya

Portfolios

The cross-fertilization of ideas and insights in a portfolio of projects is where social impact compounds.

The real leverage comes when individual projects connect. Techniques developed in one domain—such as computer vision, predictive modeling, or advances in automated speech recognition—are deployed in entirely different contexts. For example, DrivenData advancements in named entity recognition and image processing are contributing to diverse areas such as kelp forest mapping, cancer biopsy analysis, agricultural pest detection, and measuring particulate matter in the air.

“From the beginning… what we were ultimately trying to maximize was the combined and compounding impact of our entire portfolio of projects.” — Isaac Slavitt, Co-founder

Over time, DrivenData has harnessed the compounding knowledge embodied in its growing portfolio of work, such that each new effort starts further ahead than the last.

Zamba Cloud screenshot showing species prediction and bounding box detection
A screenshot from Zamba Cloud, showing the species prediction and bounding box detection for an image. This application, developed by DrivenData, enables conservationists to train and run machine learning models used to process camera trap data. It is representative of a diverse portfolio of overlapping work by the DrivenData team focused on image and video processing. Image source: DrivenData

People

Open sharing of data, knowledge, and insight creates the human and social capital that drives progress in the data science for social good ecosystem.

Advancements in technology and methods are magnified when they are available to the practitioner community. Empowering the global network of practitioners with open-source tools, data, and shared learnings turns isolated work into a movement—making it easier for others to contribute, adopt, and build on it.

“Investment in the social impact ‘commons’ of shared knowledge, resources, and open source data and tools… is one of the most powerful engines for progress in our field.”
— Greg Lipstein, Co-founder

From its founding, DrivenData has committed to open-sourcing data and tools and sharing knowledge.

Messaging platforms for agricultural entity recognition
Automating the recognition of agricultural entities (such as crops, pests, diseases, and chemicals) in WhatsApp and Telegram messages among plant doctors, to surface emerging trends and threats. Integrated into our work were trainings on reproducible data science, technical guides for git and GitHub, and best practices for project management of technical work. The result of the project was not just trained entity extraction models but also an up-skilled team.

The DrivenData 10-Year Impact Report

How these three pathways—projects, portfolios, and people—reinforce each other, and the full story of how they play out in practice is captured in detail in DrivenData’s 10-year Impact Report. We invite you to dig in and reflect on your own experience.

Impact report tiles

Stay updated

Join our newsletter or follow us for the latest on our social impact projects, data science competitions and open source work.

There was a problem. Please try again.
Subscribe successful!
Protected by reCAPTCHA. The Google Privacy Policy and Terms of Service apply.

Latest posts

All posts

insights

DrivenData 10-Year Impact Report: Three pathways to creating social impact with data science and AI

An overview of how DrivenData’s impact is built through projects, portfolios, and people working together.

tutorial

Improving Automatic Speech Recognition for Kids - A Reference Implementation for Phonetic-level Transcription

A step-by-step guide to training a model to predict phonetic symbols for the On Top of Pasketti Challenge (Phonetic Track)

tutorial

Improving Automatic Speech Recognition for Kids - A Reference Implementation for Word-level Transcription

Learn how to train a model to transcribe child speech for the On Top of Pasketti Challenge (Word Track)

insights

5 Challenges of Creating Beautiful Data Pipelines

A look into the hidden complexity of data pipelines, and some suggestions to improve the process.

insights

AI Agents in Data Science Competitions: Lessons from the Leaderboard

How good are AI agents at data science? Here's what we've learned from initial experiments about what works, what doesn't, and what the future might hold.

case studies

Linking nonprofit grants to organizations with machine learning

DrivenData built Orgmatch, a scalable and explainable entity resolution system to add value to information processed by a leading nonprofit data hub.

insights

Bringing small water bodies into view: Sentinel-2 satellite monitoring of harmful algal blooms (HABs)

CyFi enhances modern HAB monitoring programs by extending their reach and informing field-based components.

insights

Solving the last-mile public data problem

Using "baked" data to transform public data repositories into analysis-ready resources

media

DrivenData Joins U.S. Department of Energy's Genesis Mission to Advance AI for Science and the Public Good

Social impact data science organization brings decade of federal open innovation experience to historic national initiative

winners

Meet the winners of Phase 3 of the PREPARE Challenge

Learn how teams developed proof-of-concept approaches for real-world early Alzheimer's prediction

winners

Meet the winners of the AI for Advancing Instruction Challenge

Learn how the winners of the AIAI challenge leveraged multimodal classroom data to identify instructional activities and classroom discourse content.

case studies

Automating wildlife monitoring with Zamba & Zamba Cloud

DrivenData partnered with conservation researchers to create Zamba, an open-source machine learning solution that helps wildlife researchers process camera trap footage, reducing months of manual review to hours of automated analysis.

community

Community Spotlight: Paola Ruiz, Néstor González, Daniel Crovo

The Community Spotlight features fantastic members from our DrivenData community. Three members of the IGCPHARMA team, Paola Ruiz, Néstor González, and Daniel Crovo talk to us about data science, drug discovery, diverse databases and more!

community

Community Spotlight: Kirill Brodt

The Community Spotlight features fantastic members from our DrivenData community. Kirill Brodt, a researcher in computer graphics at the University of Montreal, talks animation, pose estimation, and data science challenges.

case studies

Jump-starting data infrastructure and in-house data expertise

DrivenData designed and built a data warehouse to centralize, organize, and visualize data across CodePath's operations. Our team also provided technical hiring assistance to find the right talent to carry the work forward.

case studies

A production application to support survivors of human trafficking

DrivenData developed Freedom Lifemap, a digital tool designed to support survivors of human trafficking on their journey toward reintegration and independence.

insights

Life beyond the leaderboard

What happens to winning solutions after a machine learning competition?

insights

(Tech) Infrastructure Week for the Nonprofit Sector

Reflections on how to build data and AI infrastructure in the social sector that serves the needs of nonprofits and their beneficiaries.

winners

Meet the winners of Phase 2 of the PREPARE Challenge

Learn about how winners detected cognitive decline using speech recordings and social determinants of health survey data

insights

AI sauce on everything: Reflections on ASU+GSV 2025

Data, evaltuation, product iteration, and public goods: reflections on the ASU+GSV Summit 2025.

Work with us to build a better world

Learn more about how our team is bringing the transformative power of data science and AI to organizations tackling the world's biggest challenges.