Exploring CoCo Challenges in ML Engineering Teams: Insights From the Semiconductor Industry

May 8, 20262605.07389

A. Azamnouri, M. Haug, L. Woltmann, M. Fritz, J. Bogner + 1 more

cs.SEcs.LG

TLDR

This paper explores collaboration and communication challenges in ML engineering teams within the semiconductor industry, identifying 16 issues.

Key contributions

Conducted a qualitative study on ML engineering teams in a global semiconductor company.
Identified 16 recurring collaboration and communication (CoCo) challenges in hardware-centric ML.
Found "unclear roles and responsibilities" to be the most critical CoCo issue.
Presented effective practices and recommendations to mitigate CoCo problems.

Why it matters

This study fills a critical gap by analyzing collaboration challenges in hardware-centric ML engineering teams. It reveals how unique constraints amplify coordination complexity, offering insights for improving ML system deployment and maintenance. These findings are crucial for developing better tools and practices for successful ML projects.

Original Abstract

The integration of machine learning (ML) into complex software systems has increased challenges in collaboration and communication (CoCo) of the teams building these systems. ML engineering (MLE) teams often involve diverse roles, ML engineers, data scientists, software engineers, and domain experts, each bringing unique goals, experiences, and jargon. These interdisciplinary dynamics can make it challenging to deploy, reproduce, and maintain ML-enabled systems over the long term. Previous studies have uncovered several CoCo challenges and practices, but most have focused on software-centric companies, leaving limited empirical understanding of how these dynamics unfold in hardware-centric contexts. In hardware-centric environments, CoCo challenges are shaped by additional constraints such as strict data governance, long development cycles, and tight coupling with physical processes, which amplify coordination complexity and reduce flexibility. To strengthen empirical understanding in such settings, we present a qualitative investigation of MLE teams within a global semiconductor company, where ML-enabled systems and manufacturing processes introduce additional complexity. We interviewed 12 practitioners regarding CoCo practices, tools, challenges, and approaches. Through analysis, we identified 16 recurring challenges, with unclear roles and responsibilities emerging as the most critical, and common practices and recommendations practitioners considered effective in mitigating CoCo problems. While grounded in a single organizational context, our findings align with known issues in interdisciplinary ML-enabled systems development, but also demonstrate how these challenges manifest differently under hardware-driven constraints. Our results highlight directions for future research and tool support to strengthen CoCo in MLE projects and ensure the success of ML-enabled systems.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers