Applied Science Summer Intern Assignment 2026

Assignment for candidates

This is a mandatory assignment for everyone applying for the Applied Science Internship. Please return your answers along with your application.

This assignment is exclusively for the Applied Science internship position.

Overview

Thank you for applying for Wolt's 2026 Applied Science Internship! The purpose of this assignment is to understand how you think, how you approach applied machine learning problems, and how you reason about solutions in a real product and production context.

This is not a research assignment, and we are not looking for perfect models.

We care more about:

clear problem framing
reasonable assumptions
well-justified modeling choices
awareness of limitations and production concerns

Simple, well-explained solutions are absolutely fine.

You need to submit three things:

Presentation (PDF format)
- Exactly 8 slides, including the title slide
- Must follow the mandatory presentation structure below
- No additional slides or appendix
Code used in the assignment, in a reproducible and open format
- Jupyter notebooks and/or Python scripts are fine
- Include a README.md explaining how to run your code
- Include a list of dependencies (for example requirements.txt)
Your CV, plus optional attachments describing your studies, projects, or work experience

Good communication is a key part of success in this role.

We use the presentation as the primary screening tool, and decide based on it which assignments will be reviewed at code level in more detail. Based on this review, we will invite candidates to a technical interview followed by a final interview.

Mandatory presentation structure

Your presentation must follow the structure below. We evaluate submissions slide by slide based on how clearly and thoughtfully you address these questions.

Title slide
- Your name
- Contact information
Problem and decision context
- What is the problem you are solving?
- Who uses the output of this model?
- What decision-making or business process does it support?
- Why is this problem relevant?
Data and EDA findings
- What data did you use?
- Key insights or findings from exploratory data analysis?
- Anything surprising or important for modeling?
Feature engineering and representations
- What signals did you create from the data?
- What did you choose not to use, and why?
Modeling approach and assumptions
- What modeling approach did you choose?
- What assumptions does your approach make?
Results and evaluation
- How did you evaluate the model?
- What metrics did you use?
- How good are the results for the intended use?
Limitations
- Known limitations of your approach?
- When would this break in a production setting?
Next steps
- If this were deployed to production, what would you do next to improve it?

Choosing the data

We have prepared two datasets for you. Feel free to choose which data you use based on your background and ambitions. You only need to choose one.

Order flow dataset

Consider the simulated flow of orders in Helsinki over 3 months in the provided file. The dataset contains the following columns:

order_placed_at_utc: time when the order was placed in UTC
item_count: number of items in the order
order_category: reporting category of the order
actual_delivery_time_minutes: actual delivery time from order placement to completion in minutes
estimated_delivery_time_lower_minutes: lower estimate of delivery time before the order was placed
estimated_delivery_time_upper_minutes: upper estimate of delivery time before the order was placed
venue_location_h3_index: h3 geospatial index of the venue
customer_location_h3_index: h3 geospatial index of the customer
courier_supply_index: measure of available courier supply at the time of the order placement
precipitation: forecasted hourly precipitation at the time of the order placement

Tip: you can refer to the h3 documentation to investigate the h3 indexes.

Item sales dataset

Consider the daily items sales history in the provided file. The dataset represents simulated daily grocery sales from a number of venues in Finland, and contains the following columns:

venue_id: venue identifier
sku_id: SKU (Stock Keeping Unit), i.e. internal product and variant identifier
phl1_id, phl2_id, phl3_id: PHL (Product Hierarchy Level) identifiers. Each SKU belongs to a PHL3, which in turn belongs to a PHL2, which belongs to a PHL1. These represent product categories of increasing genericity.
country_id: venue country identifier
price: unit price in EUR
promo_flag: whether a promotion is active for the given SKU on a given date
promo_depth: depth (as a percentage) of the promotion
operating_minutes: operating minutes of the venue on the given date
in_stock_minutes: how many minutes the SKU was in stock in the given venue on the given date
stockout_flag: a binary flag indicating whether the SKU ran out of stock
units_sold: units of the SKU sold in the given venue on the given date

Choosing a modeling approach

Using your chosen dataset, define a modeling task that is relevant to Wolt.

To give you an idea what we are looking for, the task might look something like these:

Can we estimate the delivery time of an order?
Where will orders be delivered in the near future?
Based on past data, can we forecast item sales for tomorrow, next week, or later?

Your task must result in some form of predictive model. You may train one or multiple models.

Your modeling approach should follow naturally from the decision context you define. Model choice without a clear link to how the output is used will be evaluated poorly.

A simple model with strong reasoning is preferred over a complex model with weak justification.

Working with the data

Exploration

Produce meaningful statistics and visualizations
Focus on insights that influence modeling choices or decisions
Exhaustive analysis is not required

Feature engineering

Explain what features you created and why
Explicitly discuss what you chose not to use

Modeling

Describe your approach and its benefits
Clearly state assumptions
Explain why this approach makes sense for the problem

Evaluation

Describe how you evaluated the model
Discuss what kinds of errors matter most
Consider how useful the model would be in practice

Limitations

Discuss known limitations and failure modes

Submitting the assignment

Bundle everything into a ZIP archive and upload it to Google Drive, Dropbox, or a similar service. Include the download link in your application.

Important notes

Do not store your solution in a public GitHub repository
Do not share your solution publicly in any form
Make sure file permissions allow us to access the materials

A good check before sending your task is to unzip the Zip archive into a new folder and check that building and running the project works, using the steps you define in readme.md. Forgotten dependencies and instructions can sometimes happen even to the best of us. If we cannot access or run your submission, we unfortunately cannot review it.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
grocery_sales_autumn_2025.csv		grocery_sales_autumn_2025.csv
orders_spring_2022.csv		orders_spring_2022.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Applied Science Summer Intern Assignment 2026

Assignment for candidates

Table of Contents

Overview

Mandatory presentation structure

Choosing the data

Order flow dataset

Item sales dataset

Choosing a modeling approach

Working with the data

Exploration

Feature engineering

Modeling

Evaluation

Limitations

Submitting the assignment

Important notes

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Applied Science Summer Intern Assignment 2026

Assignment for candidates

Table of Contents

Overview

Mandatory presentation structure

Choosing the data

Order flow dataset

Item sales dataset

Choosing a modeling approach

Working with the data

Exploration

Feature engineering

Modeling

Evaluation

Limitations

Submitting the assignment

Important notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Packages