Interview with Concept to Clinic contributor Willi Gierke (@WGierke)

We had an e-chat to learn more about prolific Concept to Clinic challenge contributor Willi Gierke (@WGierke)

This week, we had an e-chat to learn more about prolific Concept to Clinic challenge contributor Willi Gierke (@WGierke).

Hi Willi, who are you and what do you do professionally?

I'm a student studying IT-Systems Engineering at Hasso Plattner Institute in Potsdam, Germany. Currently, I'm entering my second Master's semester. I fell in love with data science during my bachelor thesis when I figured out how relatively simple it is to achieve artificial intelligence using basic math.

What got you interested in this Concept to Clinic project?

During the last semester, I attended a seminar about brain tumor segmentation using Generative Adversarial Networks. I was amazed by this kind of challenge since this has a big potential to actually help people. In my opinion, medical image segmentation clearly deviates from regular image segmentation since your target can be extremely small in comparison to your total image - you are literally searching for the needle in the haystack. At that time, I already heard of the challenge of the Data Science Bowl, but since I had no deep knowledge about biomedical image segmentation at that point, I decided not to take part in it.

A few weeks ago, I saw a post about Concept To Clinic at DataTau, which basically is the HackerNews for data scientists ;-) I was really interested in the project because it is about wrapping an algorithm that has been prototyped in an online competition in a program that should actually be used by physicians. While there already have been some competitions to create an algorithm with the highest accuracy in detecting tumors in scans of brains, lungs, livers, etc., this challenge in contrast is about a working software that should be used from end-to-end by doctors.

Do you have any advice for people who are thinking about contributing?

I'd encourage them to start with the challenge overview and user story so they immediately know what's the scope of the project, who's the user, what's the goal etc. Next, I'd suggest them to skim the issues so they might directly find something interesting. In case they run into problems or confusions, they're invited to open issues themselves and ask for help.

I'm really positively surprised by the amount of expertise that is imparted in the repository. The templates for issues and pull requests are very detailed and clean, labels and milestones are up-to-date and especially the feedback in pull requests is very helpful, detailed and friendly.

What do you hope to see from this project in the coming months?

I'd be very excited if we could actually build a piece of software that supports physicians in detecting lung nodules, that is usable (so has a nice UI and UX) and that is robust. Something really incredible would be if in the end there'd be some hospitals that actually want to deploy and use the system to help people.


Thanks to Willi for sharing! We plan to talk to more of our contributors over the coming weeks to learn about the people helping write code to fight lung cancer mortality.

Interested in contributing? There's still 6 weeks to gather points to be eligible for the first round of awards for the MVP phase of the project. Head over to the challenge site and sign up to get started!

Stay updated

Join our newsletter or follow us for the latest on our social impact projects, data science competitions and open source work.

There was a problem. Please try again.
Subscribe successful!
Protected by reCAPTCHA. The Google Privacy Policy and Terms of Service apply.

Latest posts

All posts

winners

Meet the winners of Phase 2 of the PREPARE Challenge

Learn about how winners detected cognitive decline using speech recordings and social determinants of health survey data

resources

Open-source packages for using speech data in ML

Overview of key open-source packages for extracting features from voice data to support ML applications

tutorial

Getting started with LLMs: a benchmark for the 'What's Up, Docs?' challenge

An introduction to using large language models via the benchmark to a document summarization challenge.

winners

Meet the Winners of the Goodnight Moon, Hello Early Literacy Screening Challenge

Learn about the results and winning methods from the early literacy screening challenge.

resources

Where to find a data job for a good cause

Finding data jobs for good causes can be difficult. Learn strategies, job lists, and tips to find organizations with open positions working on causes you care about.

winners

Meet the Winners of the Youth Mental Health Narratives Challenge

Learn about the winning solutions from the Youth Mental Health Challenge Automated Abstraction and Novel Variables Tracks

winners

Meet the winners of the Forecast and Final Prize Stages of the Water Supply Forecast Rodeo

Learn about the winners and winning solutions from the final stages of the Water Supply Forecast Rodeo.

insights

10 takeaways from 10 years of data science for social good

This year DrivenData celebrates our 10th birthday! We've spent the past decade working to use data science and AI for social good. Here are some lessons we've learned along the way.

tutorial

Goodnight Moon, Hello Early Literacy Screening Benchmark

In this guest post from the MIT Gabrieli Lab, we'll show you how to get started with the literacy screening challenge!

tutorial

Youth Mental Health: Automated Abstraction Benchmark

Learn how to process text narratives using open-source LLMs for the Youth Mental Health: Automated Abstraction challenge

winners

Meet the winners of Phase 1 of the PREPARE Challenge

Learn about the top datasets sourced for Phase 1 of the PREPARE Challenge.

resources

Teaching with DrivenData Competitions

Inspiration and resources for teaching students data science, machine learning, and AI skills with DrivenData competitions.

winners

Meet the winners of the Pose Bowl challenge

Learn about the top solutions submitted for the Pose Bowl: Spacecraft Detection and Pose Estimation Challenge.

winners

Meet the winners of the Water Supply Forecast Rodeo Hindcast Stage

Learn about the winning models for forecasting seasonal water supply from the first stage of the Water Supply Forecast Rodeo.

tools

Cookiecutter Data Science V2

Announcing the V2 release of Cookiecutter Data Science, the most widely adopted data science project template.

resources

How to make data science projects more open and inclusive

Key practices from the field of open science for making data science work more transparent, inclusive, and equitable.

winners

Meet the winners of the Kelp Wanted challenge

Dive into the solutions from the super segmenters who best detected kelp in Landsat imagery!

winners

Meet the winners of the SNOMED CT Entity Linking Challenge

Meet the winners with the best systems for detecting clinical terms in medical notes.

winners

Meet the winners of the Pale Blue Dot challenge

Learn about the top visuals created for the Pale Blue Dot: Visualization Challenge and the solvers behind them.

tutorial

NASA Pose Bowl - Benchmark

An introduction to the NASA Pose Bowl competition, with a benchmark solution for the object detection track

Work with us to build a better world

Learn more about how our team is bringing the transformative power of data science and AI to organizations tackling the world's biggest challenges.