Tools posts

Explore the latest in our mission to build a better world using data science and AI.

Content list

tools

Cookiecutter Data Science V2

Announcing the V2 release of Cookiecutter Data Science, the most widely adopted data science project template.

tools

Introducing Community Code

View and share helpful code related to a competition in the new Community Code section!

tools

The Basics of Python Packaging in Early 2023

Explaining the basic concepts and best practices for creating Python packages in early 2023 using pyproject.toml build standards.

tools

A peek inside DrivenData's code execution competitions

Code execution competitions allow participants to run their code on unseen test sets and have their best scores displayed on a live leaderboard. Read on to learn more about the what, why, and how of DrivenData code execution competitions!

tools

Easier Code Reviews for Jupyter Notebooks with nbautoexport

Learn how DrivenData uses nbautoexport, an open-source tool we developed, to make Jupyter Notebook code reviews easier.

tools

The missing guide to AzureML, Part 3: Connecting to data and running your machine learning pipeline

Use what you've learned about AzureML to create and run your first machine learning pipeline in the AzureML cloud.

tools

The missing guide to AzureML, Part 2: Configuring your compute script and compute target

Configure cloud hardware and software to run your machine learning code.

tools

The missing guide to AzureML, Part 1: Setting up your AzureML workspace

Get acquainted with core Azure and AzureML concepts, and set up your first AzureML workspace.

tools

Our Minimum Viable Process for changing and deploying software

Over the years, we have gradually settled on a process for changing and releasing software that is robust enough to prevent common failure modes but lightweight enough to be minimally annoying. Here’s what it looks like when we make a change.

tools

Concept To Clinic tools highlight: Travis CI

Continuous integration is a best practice for software development. We think it's a best practice for data science, too.

tools

Concept To Clinic tools highlight: FOSSA

Complying with open source licenses is difficult. Today we're talking about a tool that has made it easier for the Concept to Clinic challenge.

tools

How the Concept to Clinic challenge uses Docker for reproducibility and orchestration

Ensuring easy setup of the application stack and efficient segmentation of the various components of the Concept to Clinic challenge is precisely where Docker excels, especially when used with Docker Compose, its tool for declaring and orchestrating multiple services. In this post, we'll dive into some background on Docker and explain how we're using it in the Concept to Clinic challenge.

Stay updated

Join our newsletter or follow us for the latest on our social impact projects, data science competitions and open source work.

There was a problem. Please try again.
Subscribe successful!
Protected by reCAPTCHA. The Google Privacy Policy and Terms of Service apply.

Work with us to build a better world

Learn more about how our team is bringing the transformative power of data science and AI to organizations tackling the world's biggest challenges.