4. Modelling
(Project Lifecycle)
Feature Engineering | External Data
Code Review (Clean, commented, etc.)
Train and test ML Model
End-to-end Pipeline
5. Results & Evaluation
what is the performance of my model? E.g. confusion matrix
6. Visualization & Interface + Role of Data translator
task of the data translators
translate data into comprehensive insights
7. Pilot/ Testing
Questions to prepare the pilot:
What do you want to know? What are my goals with this pilot?
Who and how many are the end users? What’s the best way to provide them with the information?
How to best define the pilot groups?
How long should the pilot be?
8. Deployment & Roll-out
what is the impact of the solution rollout
how to fully integrate s in operations?
Stakeholders influences by rollout:
Comunications
People
Processes
Technology
Client/ Other Stakeholders
9. Insights & Decision-making
from raw data to impact
problem -> data -> analysis -> solution -> actions -> impact
We perform analysis, using the data, to inform the actions that will allow us to achieve the goals, thus having an impact in our problem (!!!)
10. Maintain and improve
Model monitoring: monitor …
… incoming data
users feedback
evaluate need for training
model redeployment
keep version control
So what are critical success factors?
Clear problem definition and measurable goals (what is being solved, why it matters, how success is measured).
Strong stakeholder engagement and committed project champions.
Early validation of data availability and quality.
Iterative prototyping with user feedback and pilots.
Scalability in mind — solutions should serve multiple stakeholders with flexible visualizations.
Trust and adoption — the best model is useless if decision-makers don’t use it.
Sustainability — plan for model monitoring, retraining, and long-term improvement.
Project example vaccination:
Problem: MMR vaccination rates have beendeclining dramatically
Goal: Predict which children are at risk of not receiving the MMR vaccine so that healthcare providers can proactively
intervene to promote vaccination
Project example vaccination: Lessons learned
Data Lifecycle and Ownership
Better understand the data quality and
specific features, to better plan the data curation phase (e.g., Data may be in a different language)
Only “lock” the scope after auditing the
data/ Problem definition
Change Management: Product/ prototype
showcase for Doctors buy-in
Internal buy-in: The importance of
strongly committed Partners/ Project Champions
Scale the solution: Try to serve as many stakeholders as possible with the same solution and different visualizations
Stakeholders, Problem definition and project goal for the three examples, Vaccination, umemployment, Fishing
Problem definition and context
Project Charter
1. What is the challenge you are trying to solve? What are the business processes impacted?
2. What is the opportunity we are trying to capture?
3. What are the main numbers about that challenge (e.g., How big is the problem?)?
4. Why is it a problem?
5. Has the partner tried any other solution before? If so, what failed? What were the lessons learned?
6. If there is already a solution in place, how good is it?
7. What are the consequences/ negative impacts of not solving the problem?
How to define a goal
Data science solution
Goal of the model/ system
What/ who is being targeted
Complement with industry/ area
Add relevant details
Data Science Teams:
decentralized model
Governance
Resources allocated only to projects within their silos
No view of analytics activities or priorities outside their function or business unit
Location
Analytics are scattered across the organization in different functions and business units
Project Management
Little to no coordination
Pros (+):
Early-stage Data Science
More domain focus
Cons (–):
Difficult to hire / define career path
centralized model
Stronger ownership and management of resource allocation
Project prioritization within a central pool
Analytics reside in a central group
Serve a variety of functions and business units
Work on diverse projects
Coordination by central analytic unit
Encourages career growth
Cohesive group
Chance to disconnect from business lines
Center of excellence
Better alignment of analytics initiatives and resource allocation to enterprise priorities
No direct operational involvement
Analysts are allocated to units throughout the organization
Their activities are coordinated by a central entity
Flexible model with balance of centralized and distributed coordination
Balanced
Highly coordinated
Demands more flexibility / open mind to implement a “non-traditional” model
Who is who in a data project?
Program Manager
Ensure the alignment of activities with the objectives of the specific program/ area
Guarantee that the project is aligned with the respective company stakeholders
Lead Data Scientist
• Define processes and best practices in terms of modelling
• Challenge modelling results and evaluation
• Validate projects in terms of fairness and bias
• Monitor squads regularly on the most critical issues
Lead Data Engineer
• Define processes and data architecture
Agile coach
• Promote the agile methodology and the respective best practices in the squads
• Support the application of the methodology in each project
• Unlock methodology-related topics
Data Analyst, Data Scientist, Data Engineer, UI/UX Designer, (IT Part)
Data Analyst
- Clean, refine, and understand datasets
-Collect external data
- Perform EDA
- Create follow-up reports (e.g., PowerBI)
Data Scientist
- Define data needs with engineers
- Build/train advanced models - Feature engineering
- Ensure code quality
Data Engineer
- Prepare and ingest data for models
- Build/maintain databases
- Integrate model results into systems
- Ensure productization of pipelines
UI/UX Designer
- Translate business & DS needs into interfaces
- Develop mock-ups & user flows
- Optimize end-user experience & adoption
Developer / Software Engineer
- Gather & analyze end
-user and DS requirements
- Design and develop/improve software, applications, websites
Product Owner, Busines Owner, Scrum Master
(Business Part)
Product Owner
- Define squad strategy, roadmap, and priorities
- Set sprint goals & user stories - Act as bridge with external stakeholders
Busines Owner
- Provide business/operation knowledge
- Define setup & implement in business context
- Ensure alignment with roles, systems, processes
Scrum Master
- Facilitate ceremonies (sprints, reviews, retrospectives)
- Remove obstacles & protect the squad from external noise
Last changeda day ago