- Karina Datascientist's Newsletter
- Posts
- The Data Analytics Project Workflow
The Data Analytics Project Workflow
You've learned SQL. You've mastered Excel. You can build dashboards.
But then you get your first real project at work, and suddenly you're staring at your screen thinking: "Where do I even start?"
Today, let's break down the actual workflow of a data analytics project.
Step 1: Understand the Business Question (Before You Touch Any Data)
This is where most analysts go wrong. They jump straight into the data.
Don't.
First, talk to your stakeholders:
What decision are they trying to make?
Why do they need this analysis now?
What will they do with the results?
Spend some time to gain clarity. It will save you hours later.
Step 2: Define Success / Definition of done
Ask: "What does a good answer look like?"
Is it a single number? A trend over time? A comparison between groups? A recommendation?
Get alignment on deliverables before you write a single query. What data are you expected to present and what is the format for deliverables (simple email, PowerPoint etc.)?
Step 3: Find and Assess Your Data
Now you can look at the data. Ask yourself:
Do we have the data needed to answer this question?
How clean is it?
What's the date range available?
Are there any obvious gaps or issues?
Sometimes you'll discover the data doesn't exist or it is archived and you have to request data recovery (my pain from this week). Better to know this on Day 1, not Day 5.
Step 4: Clean and Prepare
The unglamorous part. But necessary.
Standardize formats
Remove duplicates
Create any calculated fields you need
This usually takes 50-70% of your project time. Yes, really.
Step 5: Analyse (Finally!)
Now you can actually answer the business question.
Start simple:
Basic aggregations
Trends over time
Comparisons between groups
Then go deeper if needed.
But always tie it back to the original business question.
Step 6: Validate Your Results
Sanity check everything:
Do the numbers make sense?
Do they align with what the business expects?
If not, why not? (Sometimes unexpected results are the most valuable)
Run your analysis by a colleague if possible. Fresh eyes catch mistakes.
Step 7: Create Your Story
Data without context is just numbers.
Your deliverable should answer:
What did you find?
Why does it matter?
What should we do about it?
Choose the right format:
Dashboard for ongoing monitoring
PowerPoint for one-time strategic decisions
Written report for documentation
Remember: your slides are highlights, not a data dump.
Step 8: Present and Discuss
This is where your communication skills matter.
Start with the conclusion (executive summary). Sometimes this is what sparks a conversation and you don’t even move past the first slide
Show the key insights
Be ready to go deeper on what interests them
Step 9: Document and Handover
Future you will thank you.
Document:
Where you got the data
Any assumptions you made
Any cleaning tricky steps you made (removed duplicates, split data, filtered out specific dates)
How to refresh/update the analysis
Known limitations
If someone asks about this in 6 months, you should be able to recreate it. No need to be ‘War and Peace‘ length, it can be several bullet points.
The Real Workflow (Let's Be Honest)
In reality, it's never this linear:
You'll discover data issues in Step 5 and go back to Step 3
Stakeholders will change their mind halfway through
You'll realise you answered the wrong question and start over
The data will be messier than you thought
That's normal. That's the job.
Common Mistakes to Avoid
Starting with the data before understanding the question
Spending too much time making things perfect before getting feedback. Done is better than perfect
Not documenting as you go (and hating yourself later)
Forgetting to validate results with business logic
Creating 50-slide presentations when 10 would do
Keep pushing 💪
Karina
Python tip
New way to merge dictionaries in Python 3.9+
dict1 = {'a': 1, 'b': 2}
dict2 = {'c': 3, 'd': 4}
# Old way
merged = {**dict1, **dict2}
# New way
merged = dict1 | dict2
print(merged)Grab your freebies if you haven’t done already:
Data Playbook (CV template, Books on Data Analytics and Data Science, Examples of portfolio projects)
Need more help?
Just starting with Python? Wondering if programming is for you?
Master key data analysis tasks like cleaning, filtering, pivot and grouping data using Pandas, and learn how to present your insights visually with Matplotlib with ‘Data Analysis with Python’ masterclass.
Grab your Pandas CheatSheet here. Everything you need to know about Pandas - from file operations to visualisations in one place.
![]() | Data Analyst & Data Scientist |
