blog community

Community Spotlight: Cecil Rivers, Jr.

The Community Spotlight features fantastic members from our DrivenData community. Cecil Rivers, an electrical engineer in West Hartford, USA, talks about tricks for image processing, recruiting for diversity in the sciences, and his favorite sci-fi reads.

We are starting a new initiative at DrivenData to feature some of the fantastic members in our data science community. The goal of the Community Spotlight is to bring greater visibility to the diversity of expertise, perspectives, and experiences of our community members.

In this post we sit down with Cecil Rivers, a winner of the sustainable fisheries video challenge and electrical engineer in West Hartford, USA.


Picture of Cecil

Name: Cecil Rivers, Jr.

Hometown: West Hartford, USA


To get started, tell us a little about yourself.

I have had the opportunity to call a few places home along the East Coast after graduating with a Bachelor of Science in electrical engineering. I grew up in Virginia Beach and left Virginia after graduating from Old Dominion University. I moved to Atlanta to attend Georgia Institute of Technology for my masters.

I ultimately landed a position at an electrical protection and distribution company in Connecticut where I have been for the last 20 years. I am a research and development program manager for residential and light commercial circuit breakers that contain electronic intelligence that provide very specific protection functions that cannot be provided mechanically.

Outside of work I enjoy spending time with my wife and two sons, entering data science competitions and doing international vacation travel (14 countries and counting). We do a lot of outdoorsy kind of fun stuff like hiking, exploring waterfalls and recently I bought an inflatable boat and we go out on the Farmington River a lot.

How did you get started in data science?

I always had an interest in data science, machine learning and artificial intelligence. When I was a kid, AI was the source of all the science fiction that I grew up on. My science background is in electrical engineering with an emphasis on control systems and hardware design, but several years ago, my former company was looking to grow data science practitioners, so I volunteered. To become a practitioner, I had to take several online courses offered by John Hopkins and Stanford.

By the time I completed the online courses, my former company was purchased, so I didn’t get a chance to put the practitioner designation to use, but I started to grow a toolset that allowed me to understand the fields of data science.

Data science is a broad field. What areas are you particularly interested in?

There are two areas of data science I am particularly interested in: computer vision and ethics in artificial intelligence.

The application of computer vision impacts all of our lives from self-driving to product inspection. Computer vision capabilities for devices many years ago were limited or too expensive. Now we have modern machine learning techniques like CNNs and high performance and low cost electronics bringing computer vision to doorbells and smartphones.

On an adjacent topic to computer vision, I’m also interested in the ethics of artificial intelligence. AI is impacted by cultural and racial bias just like other parts of our society. As more and more of our society has systems powered by AI infrastructure like banking, mortgage lending and criminal justice, the question that needs to be addressed is how those systems ensure that they give a fair product to every person who accesses that system. I am always interested in how companies focusing on areas like AI, which has a low number of engineers and scientist of color and women participating in the field, ensure bias and discrimination are not part of the final product when there is a lack of representation. How do we test algorithms to ensure cultural, ethnic, sexual orientations and race biases are not programmed into the AI?

AI is impacted by cultural and racial bias just like other parts of our society. As more and more of our society has systems powered by AI infrastructure like banking, mortgage lending and criminal justice, the question that needs to be addressed is how those systems ensure that they give a fair product to every person who accesses that system.

What motivated you to join a DrivenData competition?

DrivenData’s charter of using data science to assist and solve some of the biggest social challenges is what drew me into their competitions. I first started out with the "Predict Blood Donations" and then moved on from there. In some competitions I had solutions that performed fairly well and in others I had no clue about how to approach the problem, but the fact that what the participants provided could potentially make our world a better place was the driving factor.

Even though some of the competitions have a cash prize, I’m not motivated by this. There are participants that live and breathe data science as a career. I can’t compete against that. What I can do is learn new algorithms, apply them on a real problem and add new tools to my skillsets.

Is there a particular DrivenData challenge you’ve enjoyed working on?

My favorite challenges were the "N+1 fish, N+2 fish", "Hakuna Ma-data" and "Naïve Bees Classifier". They all were image related challenges. The "N+1 fish, N+2 fish" challenge allowed me to correct my failures on a previous fish related challenge I did for Kaggle. "Hakuna Ma-data" was exciting because my family and I went on safari in Tanzania several years ago, so I have firsthand knowledge on how important it is to protect and track these precious animals and knowing that technology can be used to improve this effort was fascinating.

What was the most important technical tip or trick you used to solve the problem?

When it comes to image processing and CNNs, the best NVIDIA GPU you can afford is the way to go if you plan to build your models locally.

Shout outs: Any good tools, posts, or projects from other developers that you appreciate or think the community might enjoy?

I first want to give a shout out to my wife, Vonetta and our two sons. They allow me to pursue this safe hobby versus hurting myself with saws, hammers, drills and glue. 😊 I can get in the weeds of a challenge, but they see how much I love it and that the challenges I do are helping to make a difference, so they support my hobby 100%.

I enjoy the following links and podcast related to data science:

I recently discovered KNIME which is a free tool that allows you to manipulate and perform data analysis on spreadsheets like .xls and .csv files using visual block functions. I found it useful when I want to quickly mock up a system without coding. It has a variety of data science models like neutral networks, SVM, decision trees, etc.

What hurdles have you had to overcome to become a data scientist? What advice would you give to others facing the same challenges?

I’m not a data scientist by trade, but I find the area of data science makes me a better engineer. I think the biggest challenge data scientist face is the question of expertise. If someone defines themselves as a data scientist expert, it is a bold statement because the field of study is huge. Instead of trying to be an expert in the large ocean of data science, try to find an area that excites you and then grow your expertise from there.

Data science as a field has been struggling with a well-documented lack of diversity. How have you seen diversity and representation, or the lack thereof, impact data science problems?

Lack of diversity in all sciences is truly a recruitment problem. First companies need to understand that having diverse candidates makes their company a stronger, higher performing company. There have been dozens of studies that show that this is the case.

Second, companies need to understand that diverse candidates exist. If a company only hires from schools that are predominately white, then the pool of candidates of color will be depleted very quickly. Companies need to seek candidates where there are larger talent pools instead of relying on the same universities and colleges. These talent pools include, but are not limited to, historically black colleges and universities (HBCUs), National Society of Black Engineers (NBSE), Society of Hispanic Engineers (SHPE) and Society of Women Engineers (SWE).

Companies should not rely on their employees of color to try to recruit talent. That is just being lazy. It is showing that the only way they can find people of color is to have their own employees do the work for them. Just like anything in life, if you or your company is interested in diversity, then you have to work to achieve it.

Have you read any good books or articles recently?

Octavia Butler’s “Parable of the Sower” and “Parable of the Talents” are amazing reads. They are not data science (actually they are sci-fi), but they are very relevant to the times we are in right now.

Parable of the Sower

Novel by Octavia Butler, cover art by John Jude Palencar supplied by ISFDB

Where can the community find you online?

I can be found on LinkedIn at https://www.linkedin.com/in/cecil-rivers-1459a01/


Thanks to Cecil (@kingseso) for sharing his thoughts on work, life, and data science! We are excited to feature more great community members. If you think you or someone you know would be a great addition to a future Community Spotlight, let us know!

Stay updated

Join our newsletter or follow us for the latest on our social impact projects, data science competitions and open source work.

There was a problem. Please try again.
Subscribe successful!
Protected by reCAPTCHA. The Google Privacy Policy and Terms of Service apply.

Latest posts

All posts

winners

Meet the winners of Phase 2 of the PREPARE Challenge

Learn about how winners detected cognitive decline using speech recordings and social determinants of health survey data

resources

Open-source packages for using speech data in ML

Overview of key open-source packages for extracting features from voice data to support ML applications

tutorial

Getting started with LLMs: a benchmark for the 'What's Up, Docs?' challenge

An introduction to using large language models via the benchmark to a document summarization challenge.

winners

Meet the Winners of the Goodnight Moon, Hello Early Literacy Screening Challenge

Learn about the results and winning methods from the early literacy screening challenge.

resources

Where to find a data job for a good cause

Finding data jobs for good causes can be difficult. Learn strategies, job lists, and tips to find organizations with open positions working on causes you care about.

winners

Meet the Winners of the Youth Mental Health Narratives Challenge

Learn about the winning solutions from the Youth Mental Health Challenge Automated Abstraction and Novel Variables Tracks

winners

Meet the winners of the Forecast and Final Prize Stages of the Water Supply Forecast Rodeo

Learn about the winners and winning solutions from the final stages of the Water Supply Forecast Rodeo.

insights

10 takeaways from 10 years of data science for social good

This year DrivenData celebrates our 10th birthday! We've spent the past decade working to use data science and AI for social good. Here are some lessons we've learned along the way.

tutorial

Goodnight Moon, Hello Early Literacy Screening Benchmark

In this guest post from the MIT Gabrieli Lab, we'll show you how to get started with the literacy screening challenge!

tutorial

Youth Mental Health: Automated Abstraction Benchmark

Learn how to process text narratives using open-source LLMs for the Youth Mental Health: Automated Abstraction challenge

winners

Meet the winners of Phase 1 of the PREPARE Challenge

Learn about the top datasets sourced for Phase 1 of the PREPARE Challenge.

resources

Teaching with DrivenData Competitions

Inspiration and resources for teaching students data science, machine learning, and AI skills with DrivenData competitions.

winners

Meet the winners of the Pose Bowl challenge

Learn about the top solutions submitted for the Pose Bowl: Spacecraft Detection and Pose Estimation Challenge.

winners

Meet the winners of the Water Supply Forecast Rodeo Hindcast Stage

Learn about the winning models for forecasting seasonal water supply from the first stage of the Water Supply Forecast Rodeo.

tools

Cookiecutter Data Science V2

Announcing the V2 release of Cookiecutter Data Science, the most widely adopted data science project template.

resources

How to make data science projects more open and inclusive

Key practices from the field of open science for making data science work more transparent, inclusive, and equitable.

winners

Meet the winners of the Kelp Wanted challenge

Dive into the solutions from the super segmenters who best detected kelp in Landsat imagery!

winners

Meet the winners of the SNOMED CT Entity Linking Challenge

Meet the winners with the best systems for detecting clinical terms in medical notes.

winners

Meet the winners of the Pale Blue Dot challenge

Learn about the top visuals created for the Pale Blue Dot: Visualization Challenge and the solvers behind them.

tutorial

NASA Pose Bowl - Benchmark

An introduction to the NASA Pose Bowl competition, with a benchmark solution for the object detection track

Work with us to build a better world

Learn more about how our team is bringing the transformative power of data science and AI to organizations tackling the world's biggest challenges.