Introducing DrivenData

As we begin launching our first competitions, we thought it would be a good idea to lay out what exactly we're trying to do and why.

If your goal is to change the future, it helps to have really good predictions about what that future looks like.

And there are a lot of people interested in changing the future. Amazon wants to increase the number of goods you order through their site, so they predict which ones you might want to buy next and when. Twitter wants to boost your use of their platform, so they predict which tweets you will ignore and which you will engage with. Facebook and Google want to increase the number of ads you click on their sites, so they predict your personal click-through behavior. They have gotten very good at making these predictions.

But there are many other reasons to want to change the future. Educators want to increase the number of students graduating high school. Health workers want to improve the overall health of a population at a sustainable cost. Microlenders want to give more individuals in the developing world a chance to pursue their dreams without incurring default. Conservationists want to curb our energy usage without hampering productivity. Governments want to prevent fires from destroying lives and property.

This is where we come in. In the quest to make these difficult but important changes, we must arm ourselves with state-of-the-art predictions. Predict which students are likely to drop out before graduation back when they are in junior high, so teachers can intervene earlier. Predict which individuals will be able to repay their microloans, even when banks have shut them out in the past. Predict where fires are more likely to break out, and get there first.

Source: flickr user thenationalguard
A different kind of big data conflagration.

In today's world, the people who can make these predictions better than anyone else are data scientists. They are the modern day fortune tellers, but instead of crystal balls they wield datasets. Armed with skills in statistics and computer science, the data scientist takes large data sets and builds smart, creative, flexible models for what is likely to happen next. In 2011, there was more data produced than in all the previous years of human history combined. The quantity and variety of data available to us is exploding, and the individuals who can manipulate and illuminate these data have incredible value to offer.

At DrivenData, we want to bring cutting-edge practices in data science and crowdsourcing to some of the world's biggest social challenges and the organizations taking them on. We host online challenges, usually lasting 2-3 months, where a global community of data scientists competes to come up with the best statistical model for difficult predictive problems that make a difference.

Just like every major corporation today, nonprofits and NGOs have more data than ever before. And just like those corporations, they are trying to figure out how to make the best use of their data. We work with mission-driven organizations to identify specific predictive questions that they care about answering and can use their data to tackle.

Source: flickr user plenty
In predictive modeling, trying many different approaches is crucial.

Then we host the online competitions, where experts from around the world vie to come up with the best solution. Some competitors are experienced data scientists in the private sector, analyzing corporate data by day, saving the world by night, and testing their mettle on complex questions of impact. Others are smart, sophisticated students and researchers looking to hone their skills on real-world datasets and real-world problems. Still more have extensive experience with social sector data and want to bring their expertise to bear on new, meaningful challenges - with immediate feedback on how well their solution performs.

Like any data competition platform, we want to harness the power of crowds combined with the increasing prevalence of large, relevant datasets. Unlike other data competition platforms, our primary goal is to create actual, measurable, lasting positive change in the world with our competitions. At the end of each challenge, we work with the sponsoring organization to integrate the winning solutions, giving them the tools to drive real improvements in their impact.

"The best minds of my generation are thinking about how to make people click ads. That sucks." — Jeff Hammerbacher, 2011

Equipped with good predictions, we have opportunities to change the course of our planet that we've never had before. We want to tackle the hairiest, most challenging, and most meaningful problems in the world today. We are building a community of data experts that can take them on. This is the new frontier of social good.

Getting involved

We are launching soon and we want you to join us!

If you want to get updates about our launch this fall with exciting, real competitions, please sign up for our mailing list here and follow us on Twitter: @drivendataorg.

If you are a data scientist, feel free to create an account and start playing with our first sandbox competitions.

If you are a nonprofit or public sector organization, and want to squeeze every drop of mission effectiveness out of your data, check out the info on our site and let us know!


Peter, Greg, and Isaac
The DrivenData Team