1. Introduction
Nowadays, street delivery has become a significant challenge because many road infrastructures were built decades ago and were not designed to address the current volume of vehicles. For instance, when highway bridges were constructed, the heavy daily traffic of trucks that we see today was not expected [
1]. This has led to congestion, wear and tear on infrastructure, and inefficiencies in delivery systems, particularly during the last mile of delivery [
2].
Last-mile delivery refers to the final stage of the supply chain, where goods are transported from a distribution center to the end customer. This stage is critical, as it often constitutes the highest cost and most complex part of the logistics process [
3]. This shift in delivery methods introduces new constraints and opportunities, particularly in optimizing operations to accommodate the unique capabilities of drones. Therefore, addressing the size and weight limitations of drone-based deliveries is necessary, with a focus on lightweight parcels typically under 5 kg, which are ideal for drone transport, including small consumer goods, documents, and e-commerce packages [
4].
Integrating drones into the delivery system presents a promising solution to address the delivery challenges. Drones can bypass crowded roads and directly reach customers, reducing the demand on existing road infrastructure and improving delivery speed and efficiency. This innovative approach not only mitigates the limitations of outdated roadways but also offers a more flexible and scalable solution for handling the growing demand for fast and reliable deliveries in urban and rural areas correspondingly [
5].
This multimodal delivery system inherently adds complexity to the overall process. Optimizing such a system is challenging, particularly when it comes to tasks like determining the optimal locations for hubs, route optimization, or efficiently scheduling deliveries. By focusing on these optimization tasks, logistics managers can significantly improve the performance of multimodal delivery systems, ensuring that resources are used effectively, costs are minimized, and customer satisfaction is maximized. The complexity and interconnected nature of multimodal logistics necessitate the use of advanced optimization techniques and technologies, such as mathematical modeling, simulation, and data analytics, to achieve these goals.
These problems lie at the core of operational optimization and are heavily dependent on data-driven approaches [
6]. However, collecting the necessary data from real-world sources is often costly and may not fully capture the entire range of scenarios or conditions within the domain of interest [
7]. Additionally, access to these data is largely restricted due to privacy regulations and limitations imposed by company policies. This is where synthetic data can play an important role in a multimodal delivery system. By generating synthetic data, it is possible to simulate a wide variety of scenarios, allowing for comprehensive testing and optimization of the delivery processes without the problems associated with gathering real-world data [
8]. Synthetic data can help fill in the gaps, providing a broader and more diverse dataset that enhances the ability to fine-tune the system for better efficiency and effectiveness in a multimodal delivery environment.
In this paper, we introduce a synthetic dataset generator designed through a combination of static environmental generation and dynamic data modeling within the broader delivery system. This system accounts for a wide array of variables, including the locations of customers and delivery vehicles, such as trucks and drones. Each participant in the system is characterized by its own set of properties, including environmental impact and economic costs. Our approach specifically targets the requirements of various optimization tasks within the overall delivery process, such as optimal locating hubs or job shop scheduling. Additionally, our synthetic dataset addresses combinations of optimization problems. Our key contributions can be summarized as follows:
- 1.
Defining multimodal logistics scenarios and their key components, along with the corresponding parameters, for our synthetic dataset generator. This includes the integration of mobile micro hubs and the consideration of environmental factors such as wind conditions, communication quality, and others, which play a crucial role in the functionality and optimization of the system.
- 2.
Introducing a dataset generator capable of producing a vast array of optimization problems. Additionally, our generator enables the creation of datasets that integrate multiple optimization challenges, such as combining the Vehicle Routing Problem with path planning, thereby supporting complex and interconnected problem-solving scenarios.
- 3.
Expanding scalability, compatibility, and benchmarking capabilities of our dataset generator, which produces real-time routes based on user-defined longitude and latitude ranges. By accounting for diverse traffic constraints across different regions of the world, our generator serves as a universal scenario generator, enabling the creation of globally applicable datasets for optimization problems.
- 4.
Evaluate the generated dataset [
9] by applying it to various optimization problems, including facility location, vehicle routing, and path planning, using different techniques to demonstrate the dataset’s effectiveness and operational viability.
This paper is organized as follows: In the next section, we provide a comprehensive review of the existing literature in this field, highlighting key studies and identifying gaps that our work aims to address. In the
Section 2, we present several optimization problems that rely on data for optimization and testing. Following this, we outline our methodology and detail the development of our synthetic dataset generation algorithm, explaining the various steps involved in its construction. We then present our findings, showcasing the results and conducting a statistical evaluation of the dataset to assess its validity and applicability. Finally, we conclude the paper by summarizing our contributions and discussing potential avenues for future research.
Notation
Here, we establish the mathematical notation used throughout the paper to facilitate clear communication of the proposed approach. Vectors are denoted by bold symbols, such as , and when indexed for a specific purpose, by . The i-th entry of a vector is represented as , while refers to the vector corresponding to the k-th modality. Similarly, matrices are denoted by bold uppercase symbols, such as , with matrix indices represented by . The -th entry of a matrix is written as , and represents the matrix associated with the k-th modality. The transpose of a matrix is indicated by . The cardinality (or size) of a set is denoted as , and gradients with respect to vectors are written as . Sets are generally represented using calligraphic letters, such as , except for , which specifically denotes the loss or cost function in optimization problems. The expectation operator is symbolized by , and the function f parameterized by is written as . For clarity, we recommend referring back to this section as needed while following the technical details presented in later parts of the paper.
2. Related Work
This section reviews related research on synthetic map generation and synthetic logistical data generation, which are both integral components of the broader synthetic multimodal delivery systems data. Here, we explore how synthetic logistical data are generated to model diverse supply chain scenarios, and how synthetic maps are created to simulate various geographical and environmental conditions.
2.1. Synthetic Logistic Data Generation
The generation of synthetic customer data in logistics is an important field, driven by the need to simulate various scenarios and optimize logistics operations without relying on costly and insufficient real-world data. Financial services for example generate complex datasets due to regulatory and business constraints, limiting data sharing. Assefa et al. highlights the need for effective synthetic data generation that maintains the properties of real data while ensuring privacy [
10]. The paper of Merkuryeva et al. [
11] presents a methodology and simulation environment for optimizing multi-echelon supply chains in batch and semi-batch industries. The environment supports data management, network construction, and policy optimization, with a business case illustrating its practical application [
11].
Many real-world problems can be effectively modeled using the well-known traveling salesman problem, a topic that has been extensively studied in the literature. Numerous instances of this problem have been explored, highlighting its versatility and complexity [
12]. Kanda et al. [
12] generated a synthetic dataset and proposed a meta-learning-based approach to optimize the selection of algorithms specifically designed for traveling salesman instances. Their work emphasizes the importance of algorithm selection in achieving optimal results. Similarly, He and Xiang [
13] analyzed various traveling salesman algorithms using synthetic datasets, providing a comprehensive comparison by also testing their approaches on real-world datasets of different sizes. This dual approach allowed them to assess the robustness and scalability of the algorithms in practical scenarios. Building on these studies, Jozefowiez et al. [
14] introduced and tested an innovative approach for the bi-objective TSP with profits. Their research focused on balancing the conflicting objectives of minimizing tour length and maximizing profits, demonstrating the approach’s effectiveness on synthetic data by benchmarking it against leading single-objective methods. Salman et al. extend the classical traveling salesman problem to the precedence-constrained generalized traveling salesman problem that also includes the sequential ordering problem. The authors tested their branch-and-bound algorithm on a set of 23 synthetic instances and five industrial case studies [
15].
In this paper, we introduce a versatile dataset generator for benchmarking optimization problems, with a particular focus on complex multimodal logistics scenarios. Unlike previous approaches that are limited to generating data for the traveling salesman problem or other specific applications, our generator can be used for many optimization problems or combinations of optimization problems. We define the key components and parameters essential for multimodal logistics. Additionally, we enhance the scalability and realistic global applicability of the generator by enabling dataset creation based on user-defined geographic ranges, while considering regional traffic and constraints. Our work offers a comprehensive and adaptable solution for optimization research and benchmarking in multimodal logistics.
2.2. Synthetic Map Generation
In many studies on multimodal logistics, researchers often neglect the impact of environmental factors in their optimization processes [
16,
17,
18]. However, diverse environmental conditions such as terrain characteristics, safety restrictions, communication network coverage, and more play a crucial role in real-world multimodal logistics challenges. Incorporating these factors into optimization strategies is essential to develop solutions that are both practical and effective across varying scenarios.
In logistics, weather conditions play a critical role in planning and optimizing delivery operations, especially in scenarios involving last-mile delivery and multimodal logistics systems [
5]. Weather data significantly impact routing, vehicle selection, and delivery scheduling. For example, adverse weather conditions such as heavy rain or snow can delay deliveries, increase transportation costs, and necessitate alternative routes or modes of transport. Studies like those by [
19,
20] highlight the importance of integrating weather variables into logistics optimization but note the lack of readily available datasets for such tasks. This gap has motivated the use of synthetic data generation tools. Therefore, generating synthetic weather data has emerged as a solution for simulating diverse weather conditions to test and improve delivery strategies under controlled realistic conditions [
21]. Several approaches to synthetic weather data generation have been proposed, leveraging statistical models, machine learning, and physical simulations. Statistical models like WGEN and LARS-WG use observed climate patterns to simulate variables like temperature and precipitation under diverse conditions. Such tools can be leveraged to simulate delivery challenges under varying weather impacts [
22]. Studies like [
23] have applied generative adversarial networks to create diverse weather scenarios for urban delivery testing, enabling experimentation across rare and extreme weather conditions that are otherwise difficult to observe in real datasets. Synthetic weather data are particularly valuable in the context of delivery drones and autonomous vehicles. Research by [
24] demonstrates how synthetic weather scenarios are used to test UAV navigation systems under varying visibility and wind conditions. Another example is the use of synthetic data for multimodal logistics, where weather simulations help evaluate the robustness of route optimization algorithms across different transportation modes.
Another aspect of map generation is considering the terrain which plays a crucial role in multimodal logistics because it directly impacts the efficiency, cost, and feasibility of transporting goods across different regions. Terrain generation is commonly used in game design, especially in open-world and sandbox games where expansive, randomly generated environments are essential [
25]. It has been effectively combined with techniques like Voronoi diagrams to enhance the detail and diversity of these virtual worlds [
26]. Procedural methods are key components for generating terrains and continents that have proven successful, as demonstrated in previous studies [
27,
28]. Some of the techniques used in the procedural generation of terrains and continents include Perlin noise [
29], L-systems [
30] and fractals [
31]. Those techniques can be used alone, combined with each other, or combined with other techniques such as erosion simulation [
32]. The procedural generation can also be parametrically controlled to avoid completely random content. The parametrization can be achieved in different ways such as adjusting division limits [
33].
This highlights the increasing importance of synthetic weather data in enhancing delivery logistics, especially for testing robust scenarios and developing resilient optimization strategies. Rather than focusing on creating terrain or weather data, our approach models factors such as network quality, safety areas, and other critical elements within logistics scenarios. By simulating environmental influences like traffic density, connectivity, and potential hazards, we reflect real-world conditions that can impact logistical operations. These factors not only provide a more accurate representation of logistical challenges but also affect a variety of optimization problems.
3. Multimodal Logistic Optimization Problems
In this section, we outline four potential optimization challenges within multimodal delivery systems and demonstrate how our approach effectively addresses these issues, providing a framework for testing various algorithms. After gathering all multimodal logistic optimization problems, we can identify the essential parameters and measurements within the dataset. This ensures that our synthetic dataset serves as a versatile benchmark, suitable for evaluating a wide range of optimization problems. Additionally, its flexibility allows researchers and practitioners to adapt it to specific scenarios, making it a valuable tool for performance comparison and algorithm validation.
3.1. Route Optimization
Route optimization focuses on determining the most efficient path for vehicles to take [
34]. This task involves minimizing the distance or time traveled, directly influencing fuel consumption, operational costs, and delivery times [
35,
36]. A common approach is Shortest Path Routing, which uses algorithms like Dijkstra’s or A* to compute the quickest route between two points on a map [
37]. However, in real-world logistics, conditions such as traffic congestion, road closures, or environmental hazards may change frequently, necessitating dynamic routing [
38]. Dynamic routing algorithms adjust the vehicle’s route in real-time, often incorporating live traffic data, weather information, and vehicle location tracking. This ensures punctual deliveries and helps reduce fuel costs and emissions by avoiding traffic jams or less optimal routes. By integrating both static and dynamic approaches, logistics operators can efficiently manage their fleets across diverse areas [
39].
In addition to the traditional route optimization for conventional trucks, there are also route optimizations for other multimodal vehicles [
40]. Drones play a crucial role in this context, as they often cannot take a direct path to their target. Certain restrictions apply, such as avoiding specific areas or crossing streets only at orthogonal angles. Additionally, drones may need to bypass regions with poor communication network coverage. Another important factor in drone routing is the consideration of safety zones, which must be strictly regulated as well [
5]. Furthermore, it is essential to account for the combination of different transportation modalities as presented in
Figure 1. For instance, a truck may deliver goods to a designated station, from which a drone then carries the package to the final destination. This hybrid approach requires careful coordination between different vehicle types to ensure efficiency across the entire route.
To effectively model real-world routing scenarios, a synthetic dataset generator must be capable of producing diverse, realistic datasets that capture the dynamic nature of transportation environments. Specifically, the dataset should include a variety of real-time routes, while also simulating essential environmental factors such as communication network quality, fluctuating traffic densities, and changing weather conditions. These features are necessary for enabling comprehensive analysis of multimodal routing strategies under various conditions. Additionally, to support machine learning applications, the dataset generator should be able to produce large volumes of data. This capability would allow researchers to train algorithms that can predict optimal routing choices, detect traffic pattern trends, and improve overall logistics efficiency.
3.2. Scheduling Optimization
Scheduling optimization in logistics revolves around the efficient allocation of resources such as machines, vehicles, and cargo, ensuring that schedules are made on time and at the lowest possible cost [
42]. Scheduling aims to optimize when deliveries are made to meet customer requirements, such as delivery windows, while minimizing the total travel time and ensuring vehicle availability. Proper scheduling also considers external factors like warehouse operating hours or time-sensitive deliveries for specific goods. Vehicle scheduling involves planning vehicle usage, and ensuring that the fleet is deployed in the most efficient way possible as presented in
Figure 2. For example, multi-stop deliveries might require scheduling vehicles so that routes are as short as possible while meeting all customer requirements. This becomes even more challenging in multimodal logistics, where different transportation modes must be coordinated. Effective scheduling reduces downtime, ensures optimal fleet utilization, and can reduce operational costs [
43].
Additionally, scheduling in multimodal logistic networks, such as those involving both drones and trucks, requires careful resource allocation and coordination between different transport modes. This involves not only determining which vehicle or modality to use for each leg of delivery but also ensuring that all necessary resources such as vehicles, fuel or battery levels, and cargo space are available at the right time. For instance, a truck may need to deliver a package to a transfer point before a drone can take over for the last mile. In such cases, optimizing the timing between modes becomes critical to avoid delays at transfer points and minimize idle time for either vehicle. Moreover, scheduling must account for specific constraints unique to each modality. For example, drone flight schedules might be restricted by battery life, charging times, weather conditions, or airspace regulations, while trucks face different constraints such as traffic conditions, road infrastructure, and driver hours-of-service regulations. Coordinating these constraints across modalities ensures seamless transitions and reduces potential bottlenecks in the logistics network. Furthermore, the integration of real-time data—such as traffic updates, weather forecasts, and live tracking of vehicle status—can enhance the scheduling process, allowing dynamic adjustments to be made as conditions change. This flexibility is crucial in multimodal logistics, where unexpected disruptions can have a ripple effect across the entire delivery chain. By continuously optimizing the schedule in response to real-time conditions, logistics operations can maintain efficiency, meet delivery windows, and reduce overall operational costs [
43].
For effective scheduling optimization, a synthetic dataset generator needs to produce a continuous stream of realistic scenarios that incorporate a range of operational constraints and variables. Such a dataset should enable the simulation of various scheduling strategies, allowing for a detailed evaluation of their performance across diverse conditions. By generating data that reflect these complexities, the synthetic dataset can support logistics companies in refining their scheduling processes, ensuring that strategies remain responsive to customer demands while optimizing resource allocation.
3.3. Network Design and Configuration
Network design and configuration are critical for establishing efficient logistics infrastructure. In multimodal logistics networks, apart from the warehouses, intermediate hubs play a crucial role in connecting different transport modes or serving as waypoints for vehicle operations. These hubs can be designed exclusively for road vehicles, for drones, or as hybrid hubs serving both. Depending on the type, the hubs will have distinct characteristics and optimization goals. For instance, the farther a drone’s target is, the more hubs may be required for battery recharging or package transfers over longer distances. Additionally, trucks aim to minimize economic and environmental costs, optimizing for fewer stops and efficient route planning. One example of a micro hub is presented in
Figure 3.
In multimodal drone logistics, there are three primary types of micro hubs. Storage hubs function as temporary storage locations, improving inventory management and enabling the swift dispatch of goods [
44]. This kind of hub can be used in an unimodal as well as in a multimodal manner. Charging hubs allow vehicles run by the battery to recharge, thereby extending their range for long-distance deliveries [
45]. Modality change hubs facilitate the seamless transfer of goods between various transportation modes, enhancing the overall efficiency of the logistics network [
46]. These hubs can be designed in various sizes to suit specific environmental and logistical needs. The best example of this is the transition from a truck to various drones for last-mile delivery.
The facility location optimization shown in
Figure 4 involves identifying the best locations for distribution centers or transfer hubs, which serve as critical points in the logistics network where goods are sorted and routed to their destinations. Placing hubs in optimal locations can significantly reduce transportation costs and improve service levels [
47]. A well-optimized network topology minimizes the total cost of goods movement while maintaining service reliability and flexibility. It is essential to consider various constraints like transportation capacity, demand, and service time windows [
48]. The placement of the micro hub, whether by selecting a position from a predefined set of discrete locations or by allowing maximum flexibility through continuous positioning, is shaped by a range of factors. As a central information hub, it requires reliable communication quality and should be located in easily accessible areas, preferably with low traffic. Additionally, weather conditions play a significant role, particularly for drones, even if they have less impact on trucks. Some hubs are mobile, meaning they can transport drones to specific locations, from which the drones can then swarm to their targets. These mobile hubs must also have sufficient drone capacity. Overall, the capacities of drones, hubs, and trucks must be optimized to meet demand efficiently.
To accurately optimize logistics scenarios and estimate the associated optimization costs, a synthetic dataset must provide data that supports realistic modeling of complex logistics processes. In particular, the dataset should be robust enough to enable testing within advanced frameworks, such as reinforcement learning, to identify optimal logistics solutions. In this work, as described in
Section 5, we demonstrate a proof of concept by leveraging reinforcement learning to determine the optimal placement of a micro hub, guided by cost considerations.
3.4. Environmental Impact Reduction
All optimization problems can be adapted to multi-objective optimization by incorporating environmental impact as an additional objective to minimize. Environmental impact reduction is an essential aspect of modern logistics, as companies strive to lower their carbon footprint. Sustainable routing focuses on finding routes that minimize fuel consumption and carbon emissions. This involves using alternative routes with less traffic or flatter terrains, which result in less fuel usage [
49]. Additionally, integrating electric or hybrid vehicles into the fleet can significantly reduce emissions, although this requires optimization in terms of charging and maintenance schedules. Eco-friendly vehicle utilization involves selecting the right vehicles based on their efficiency, load capacity, and emission levels. It also means ensuring that vehicles are not idling unnecessarily, which leads to wasted fuel and higher emissions [
50]. Optimization models help logistics companies make decisions that balance environmental impact with cost-effectiveness, such as determining the optimal number of trips or selecting modes of transport that emit less CO
2 [
51].
To effectively incorporate environmental considerations into logistics optimization, a synthetic dataset must support the customization of environmental cost factors associated with each vehicle. This capability would enable the adjustment of variables like fuel consumption, emissions, and energy usage, allowing for a detailed modeling of the environmental impact across different logistics scenarios.
4. Synthetic Dataset Generator
In this section, we first define the problem associated with creating a multimodal synthetic logistics dataset. After outlining the problem, we introduce our methods for generating this dataset. We begin by detailing the necessary parameters and the general framework of our synthetic dataset generator. Next, we describe the approach for creating the maps. Finally, we explain the procedures for generating the individual elements involved in the logistics network. Combining each component results in the overall dataset.
4.1. Problem Definition
Let us consider a multimodal logistics network described by a tuple , where is a directed graph representing the network. Here, is the set of nodes (e.g., warehouses, trans-shipment points) and is the set of edges that represent possible transportation links. is the set of available transportation modes (e.g., road, rail, air, water), where each mode is characterized by specific properties such as capacity and costs. is the set of routes, where each route is a sequence of edges realized by choosing a mode for each edge. The problem is to generate a synthetic dataset that simulates the behavior of the logistics network under different scenarios.
Let
be the cost function that describes the total cost of an order
s using mode
m along route
r. The synthetic dataset
is intended to be a collection of scenarios
, where each
is a realization of the random variables
according to the probability distribution
. The aim is to generate
such that it approximates the properties of the real network sufficiently well, i.e., it should hold the following:
Here,
stands for the expected value under the distribution
, and
for the expected value based on real data.
4.2. Data Collection and Parameters
In this section, we present a comprehensive set of parameters related to the previously presented optimization problems, organized into four main categories: clients, orders, warehouses, and vehicles.
Table 1 outlines the classification of various parameters, forming the foundation for our proposed synthetic dataset generator.
In the following section, we provide an overview of the framework for our multimodal logistics scenario, detailing the overall structure and explaining the generation process for each parameter listed in the table. We will walk through the methods and logic used to produce realistic and varied datasets, capturing the complexities and interactions essential for a multimodal logistic dataset.
4.3. General Framework
The synthetic dataset generator consists of several stages in which different components of the logistics network are randomly generated. These include the creation of environmental constraints and limitations (e.g., communication quality), the definition of nodes (e.g., warehouses, micro hubs, customers), and the definition of vehicles (e.g., trucks, drones). Each of these components is generated using probabilistic models and mathematical distributions, ensuring a diverse range of scenarios that reflect real-world complexities. For example, the network environments are created by simulating various geographical features and layouts, while the nodes are characterized by parameters such as capacity, demand, and position, which can influence the efficiency of the logistics network. In
Figure 5, we show all the components of the multimodal delivery framework.
Once all relevant components have been generated, they are integrated to form a comprehensive synthetic dataset, capturing the intricate relationships and interactions within the logistics system. The calculation of the edges, which includes traffic data and potential routes, is performed to further enhance the dataset’s realism. This synthesized dataset enables a benchmark for researchers to evaluate different strategies and operational decisions. Additionally, it serves as an analytical environment for various applications, such as reinforcement learning, allowing for the testing and development of algorithms that can optimize logistics operations under varying conditions and constraints.
The parameters that are not of interest can be deactivated, ensuring they remain constant throughout the data generation process. For example, environmental constraints like network quality, weather conditions, or traffic density can be kept stable across all iterations. Meanwhile, variables such as customer locations and demand can vary with each iteration, adding flexibility to the generation process and enabling dynamic scenario adjustments. These static elements are crucial as they define the fundamental constraints of the multimodal delivery system, establishing a stable framework that persists throughout the data generation process. In our dataset generator, we can also choose to exclude certain non-essential data to save memory and improve processing speed for the underlying task. By selectively skipping optional data, we reduce the computational load and optimize resource usage, which is particularly useful when dealing with large datasets or real-time processing. This flexibility allows us to focus on the most relevant parameters for the task at hand, ensuring efficient data handling without the need for following data processing.
4.4. Design of Environmental Conditions
In order to optimize a multimodal logistics network, information about the position of nodes and edges alone is not enough. Rather, additional data are required that reflect the current situational conditions. The communication quality influences the control and data exchange between vehicles. Low communication quality leads to delays or failures, which is particularly problematic for the real-time navigation of drones. Traffic density directly impacts the speed and route selection of trucks. High densities cause congestion and force route changes that affect the entire network. Wind directions and speeds are crucial for drones as they affect stability and energy consumption. These conditions should also be considered when it comes to the positioning of micro hubs, as these represent a central landing point for different modes of transportation.
Collecting real-world data for complex scenarios, particularly in multimodal logistics where variables like communication quality, traffic density, and environmental influences interact, is challenging. This difficulty arises because real-world logistics data are often lacking and costly. Furthermore, environmental conditions vary unpredictably, making it hard to capture these factors consistently. Synthetic data generation becomes essential under these constraints, as it allows researchers to build controlled, customizable scenarios that are rare or impractical to capture in the real world. Nevertheless, generating realistic environmental conditions for a multimodal synthetic scenario poses a variety of challenges. These variables are highly dependent on local gradients and stochastic variations and cannot simply be represented by static values. Instead, it is necessary to capture realistic variability and dynamics, such as gradient noise and gradient fields.
To simulate these environmental conditions, we employ two distinct approaches. First, we utilize gradient noise to model continuous background variables such as traffic density, communication quality, and forbidden zones. Additionally, we generate gradient vectors to represent more variable elements like wind and other weather conditions.
4.4.1. Gradient Noise for Stochastic Variability
We use a Perlin noise pattern to model gradient noise maps, often used in computer graphics to create natural-looking textures. The main function generates a Perlin noise pattern based on a defined grid of gradients with cells with integer coordinates
. This grid is scaled and wrapped to fit within the unit interval
:
At each grid point
, we assign a gradient vector
defined by a random angle
:
where
is uniformly distributed over
. The gradient vector
describes the directional influence of the point
on nearby locations within the cell.
For a given point
within a cell
, we compute the distance vectors from the four nearest grid points (corners of the cell):
These vectors represent the relative positions within the cell with respect to each of the four corners. To determine the noise influence from each corner, we take the dot product of each corner’s gradient vector with its corresponding distance vector. This yields the following:
These values
represent the noise contributions from each of the four cell corners, weighted by the direction and distance within the cell. To smoothly interpolate between the values
, we apply a fade function
which is based on the original work of Ken Perlin [
53]. This specific polynomial is a smooth step function ensuring smooth transitions with zero derivatives at both endpoints
and
, providing continuity. With
and
, we interpolate along the
x-axis first and then along the
y-axis:
The final result,
, is a continuous value that smoothly varies across the grid, creating a natural-looking gradient noise. Lastly, the noise values are then normalized using min–max normalization
scaling them to a target range, typically
. This model can be utilized to generate a random map that covers network communication, terrain-based safety features, or other comparable aspects. These maps can either be set as fixed, remaining unchanged once generated, or they can be updated with each new instance. Additionally, a threshold can be used to model forbidden zones, which results in a binary map. Two of these maps are represented in
Figure 6.
This gradient noise can not model strong winds or heavy rain. If this is present in a particular location, it may be advisable to position the mobile micro hub elsewhere to enhance operational efficiency. Adverse weather can significantly affect a drone’s battery consumption and overall performance, as drones are not equipped to handle all weather conditions effectively.
4.4.2. Gradient Fields for Weather Conditions
To generate random gradient vectors that model wind behavior, we begin by establishing a two-dimensional grid in the Cartesian coordinate system, which serves as the domain for evaluating wind vectors at various points. We define the grid by specifying
x and
y coordinates within the interval
. Specifically, we let
denote the number of grid points along each axis, such that
x and
y can be represented as follows:
The mesh grid function is then employed to create matrices
X and
Y, representing all combinations of these coordinate values. Given two vectors,
and
defined as
and
; then, the meshgrid creates two matrices
X and
Y based on these vectors, where
X contains repeated rows of
and
Y contains repeated columns of
. Mathematically, this can be expressed as
and
for
.
Next, we introduce sources and sinks to simulate the generation and absorption of wind. Let
and
be hyperparameters representing the number of sources and sinks, respectively, sampled from a uniform integer distribution
and
. The positions of these sources and sinks are generated by sampling from a uniform continuous distribution over the interval
. Specifically, the coordinates for sources and sinks are represented as follows:
To simulate wind behavior, we initialize the wind vector components
U and
V to zero across the grid. The influence of each source on the wind vectors is calculated by considering the Euclidean distance from the source to each point in the grid. For a source located at
, the components of the wind vector are updated as follows:
In contrast, the influence of each sink on the wind vectors is computed by creating vectors that point toward the sink. For a sink located at
, the updates to the wind vector components are made as follows:
To introduce natural variability that reflects the stochastic nature of wind, we apply small random perturbations to both vector components
U and
V. This can be conceptualized as drawing from a uniform distribution centered at zero, denoted as
and
, where
is a hyperparameter that controls the magnitude of the perturbations. Thus, the final wind vector components can be expressed as follows:
The outcome is a two-dimensional vector field where the arrows represent the wind direction and magnitude, demonstrating how the presence of randomly placed sources and sinks influences the flow dynamics across the specified area. This methodology represented in
Figure 7 effectively integrates mathematical principles with random sampling techniques to create a realistic model of wind behavior, capturing the complexities inherent in natural systems.
By integrating these environmental factors into our logistical modeling, we model better real-world conditions and improve the overall synthetic multimodal logistics network.
4.5. Generation of Nodes
Here, we consider three kinds of nodes: warehouses, micro hubs, and customers, where the micro hubs share the same properties as the logistic centers. The warehouse plays a crucial role in this system, functioning as a central hub where various elements of the logistics network converge. After the goods are efficiently processed and dispatched from the warehouse, the next critical step is their transport to the micro hubs. There, the mode of delivery shifts from truck to drone, allowing the final leg of the journey to the customer to proceed, ensuring timely and precise delivery. In a logistics distribution scenario, customers play the next central role in shaping the logistics strategy, influencing demand patterns, and impacting overall supply chain efficiency. Therefore, we uniformly distribute the amount of the different nodes to cover the complete domain effectively and create diverse scenarios:
for customers,
for warehouses,
for micro hubs. Each created node obtains its ID
,
,
as described in
Table 1.
The precise positioning of the warehouses, customers, and micro hubs within the network becomes essential for optimizing the delivery routes and overall system. The positions
of the nodes are randomly distributed on a two-dimensional surface
. The coordinates of a node
are described by independent random variables
, each of which follows a continuous uniform distribution over
since every point in the surface has the same probability of being the position of a node:
Here,
a and
b span the surface
and describe geographic coordinates which are also calculated to Cartesian coordinates based on the rectangular projection that maps the spherical coordinates onto a flat surface. This method assumes the Earth is a perfect sphere, which is an approximation.
The number of nodes and the position of the nodes were all the shared parameters. From this point forward, we will examine each type of node individually.
4.5.1. Customers Parameter Sampling
Considering the diverse characteristics of customers enhances service delivery and ultimately achieves customer satisfaction. The demand of a customer
k for the transportation of goods is modeled by a discrete uniform distribution:
Here, demand encompasses both the number of orders and the specific attributes of each order
b, including importance, sensitivity, weight, and timestamp. The importance
and sensitivity
are modeled as discrete values uniformly distributed over the range
. The weight
W, in contrast, follows a continuous uniform distribution within
, where
denotes the maximum allowable weight. The timestamp is provided in ISO 8601 format [
52], an unambiguous, internationally recognized calendar-and-clock format. Every order has also a status parameter which is initially set to 0, which means that this order is open. Lastly, every order is also assigned to a random warehouse or micro hub, where the order is initially stored.
Our scenarios are designed to address both Business-to-Consumer (B2C) and Business-to-Business (B2B) deliveries, with a particular focus on B2C applications, which are most commonly associated with drone-based logistics. B2C and B2B logistics differ significantly in terms of complexity and focus. B2C logistics involves a longer supply chain with multiple intermediaries and focuses on delivering products to individual consumers, prioritizing fast delivery and flexible shipping with many dense nodes. B2B logistics, on the other hand, often involves a shorter and more direct supply chain with fewer intermediaries, focusing on delivering goods to businesses with specific requirements for timing. B2B orders tend to be larger in volume but less in the amount and density of nodes. However, the generator is flexible and can also be adapted for B2B use cases, by specifying the amount of customers and their demand [
54].
4.5.2. Warehouse Parameter Sampling
To model the any logistics center
i, one key property, the storage capacity
, needs to be defined:
where
is the average storage capacity of a warehouse and
defines the variance in storage capacity due to factors such as design variability, seasonal demand fluctuations, or operational constraints.
Maintaining this capacity incurs costs. The primary objective of the warehouse is to efficiently manage and store inventory to meet demand while minimizing overall expenses. To account for these costs, we allocate both variable and fixed costs associated with the warehouse according to a log-normal distribution. A log-normal distribution is suitable for costs that can vary widely but are generally positive (e.g., for warehouses with periodic capital investments). This can reflect the skew toward higher costs, as certain maintenance or repairs can add substantially to fixed expenses. This sampling is defined as follows:
where
and
represent the means, and
and
represent the variances of variable costs and the fix costs.
Given the increasing urgency around climate change and global warming, businesses are placing greater emphasis on understanding the environmental impact of their activities and processes. To address this, we also model the ecological costs associated with warehouse operations using log-normal distribution. The log-normal distribution is ideal for modeling ecological costs as it captures positive-only, skewed data with typical low-to-moderate values but allows for rare, high-cost events. This reflects the nature of environmental impacts in warehouses, where occasional spikes occur due to unexpected high-energy or resource-intensive activities. The distributions are described as follows:
where
,
and
,
represents the means and variances of the impact of the ecologic cost.
4.5.3. Micro Hub Parameter Sampling
In our framework, mobile centers play a critical role in the delivery process, particularly in the context of drone-based last-mile logistics. These mobile centers, or micro hubs, function as storage units that share the same characteristics as a traditional warehouse. They are designed to store and organize goods efficiently, acting as temporary distribution points that facilitate faster deliveries to the final customer. Mobile centers are especially useful in urban areas where they can be strategically placed to optimize delivery routes, reduce travel times, and alleviate congestion. By incorporating these mobile centers into our logistics model, we enhance the flexibility and scalability of the delivery system, ensuring that goods can be stored closer to targets, thus improving overall efficiency in the last-mile delivery process.
The micro hub functions mainly as a storage unit and shares the same characteristics as a traditional warehouse, such as the ability to store and organize goods efficiently. In addition to its core characteristics, the micro hub has an extra parameter representing the number of drone slots available. This enables the micro hub to transport drones to specific locations, where the last-mile delivery is achieved. The number of drone slots is sampled from a uniform distribution . Additionally, each micro hub has a hub-type parameter , which specifies whether it serves a single vehicle type, such as trucks (in which case drone slots are set to zero), or accommodates multiple vehicle types. Furthermore, to the warehouse parameters, it possesses the attributes of a vehicle, specifically a truck. The status parameter of the micro-hub indicates whether the hub is currently functioning as a mobile vehicle, actively changing position, or as a stationary micro-warehouse. This status also determines the applicable cost model: a time-based model for stationary warehouse operations or a distance-based model for vehicle movement. The specific vehicle-related properties and capabilities of the micro hub are detailed in the following section.
4.6. Generation of Vehicles
Vehicles in a logistics distribution network are diverse, each serving specific roles and functions based on the requirements of the logistics operation. By strategically managing the fleet of vehicles and their properties, we can improve operational performance and drive cost efficiencies within the supply chain. The number of vehicles of a type
is also generated randomly similarly as the nodes with a discrete uniform distribution
. Then, for each vehicle
u, a start position
is randomly assigned to a warehouse
i. The capacities of the vehicles
, their speed
, and the weight
are modeled by normal distributions:
In this context, the capacity, speed, and weight parameters vary based on the vehicle type
m, which specifies whether the vehicle is a drone, truck, or another transport mode. For example, drones typically have lower carrying capacities and weights but higher agility and speed over short distances, while trucks offer greater capacity and weight limits, suitable for longer hauls and larger payloads. Each vehicle type’s unique specifications ensure optimized performance depending on the intended delivery range and payload size.
To determine the maximum range of a vehicle, we start by sampling its battery or fuel level from a continuous uniform distribution , representing a minimum of 20% capacity. The energy consumption rate is then sampled from a normal distribution, , where the mean and variance depend on the vehicle type (e.g., drone or truck). The maximum range is calculated as , linking available energy and consumption rate to the distance the vehicle can travel.
The cost structure mirrors that of the warehouse outlined in
Section 4.5.2, with the key difference that here costs are based on route distances, while warehouse costs are time-dependent. Consequently, the total duration and route distances are calculated according to the approach described in the following section, which covers the generation of edges.
4.7. Generating the Edges
For each possible pair of nodes we decide whether an edge exists. This is modeled by real-time traffic for transit between two nodes. Therefore, we take the locations of one pair as input and construct a query to the Open Source Routing Machine OSRM API to obtain routing information specifically for car travel. The API response provides key metrics, including the distance in kilometers and the estimated travel time in minutes. Additionally, if detailed route information is requested, the method retrieves specific nodes along the route to gather further geographic details. These nodes are then cross-referenced with the OpenStreetMap database to obtain their precise latitude and longitude coordinates. The final output consists of the total travel distance, the estimated duration, and, if applicable, a list of the geographic coordinates of the route nodes. Alternatively, for drones, the route depends on the Euclidean distance . Each edge is provided with additional attributes that represent, for example, the environmental and economic costs , the travel duration and the safety rating .
5. Evaluation
In this section, we will evaluate the generated dataset to illustrate its characteristics. To evaluate the structure and quality of the synthetic dataset, we focus on a specific instance that models a logistics distribution network. Finally, we will demonstrate the practical application of our dataset by applying it to a classical machine learning and path planning application, showcasing its utility and effectiveness in a real-world context.
5.1. Dataset Instances
To demonstrate the quality and structure of our synthetic dataset [
9], we analyze a representative sample graph generated by our model in
Figure 8 and
Figure 9. This instance contains key properties that reflect the overall characteristics of the samples. In this instance, we consider three key entities: customers, micro hubs, and logistic centers, with a background gradient map representing the relative quality of the connectivity and the wind properties. Darker regions in the gradient indicate lower-quality connections (high latency or poor infrastructure), while lighter regions represent areas with high-quality connections.
Nodes are positioned randomly within predefined regions, each specified by minimum and maximum latitude and longitude boundaries. For each customer, we use the Euclidean distance for each drone, focusing on simplified, straight-line paths that approximate direct air travel. In this scenario, network communication is assumed to be used only for optimizing the placement of micro hubs, allowing efficient coverage across all customer locations, and not for drone routing. In contrast, routes from the main deposits to the micro hubs are managed by trucks, which require real-time route calculations. These truck routes are calculated through the OpenStreetMap (OSM) API, as discussed in
Section 4.7, and reflect realistic driving paths. By leveraging OSM’s real-time routing, we capture the influence of road networks, traffic conditions, and other logistical constraints that impact ground transport. This approach of direct drone routes and road-based truck routes allows us to simulate the dynamics of multimodal delivery networks, highlighting the interactions between drones’ air travel capabilities and trucks’ dependence on road infrastructure.
In more realistic scenarios, we can enhance the precision of location-based optimizations by manually defining distinct regions, such as rural areas, urban areas, and zones near highways, which are critical for setting customer locations and determining warehouse placements. Furthermore, given the use of real-time routing, it is also possible to sample data at various times of the day—for instance, morning hours when traffic density is typically higher, and nighttime hours when traffic is generally lighter. This temporal sampling enables to account for daily traffic variations, thereby enhancing the robustness and realism of the optimization process.
5.2. Facility Localization
In this section, we present a proof of concept of our synthetic dataset. This proof of concept is not a fully optimized solution, as we have not performed a comparative analysis. We simply demonstrate the applicability of this dataset generator to reinforcement learning tasks. Therefore, we formalize the reinforcement learning setup of [
55] for optimizing micro hub locations using a custom environment that incorporates spatial and network considerations. The problem is framed as a Markov decision process
, where
is the continuous state space, representing the environment’s features, such as customer locations, network quality, and distances to warehouses. Each state
at time
t is a
d-dimensional vector:
, where
d is the number of features. We define the continuous action space as
, where each action
represents the latitude and longitude of a proposed micro hub location and is constrained by
to increase the accuracy of positioning and avoid unnecessary calculations in areas outside the relevant area. This reduction makes it possible to reduce the search area and capture the relevant positions more precisely, as the already known area is narrowed down and analyzed in a focused manner. This increases the efficiency of the positioning process by only carrying out an analysis within the restricted area. The transition dynamics
are deterministic, representing the environment’s response to the proposed micro hub location. The reward function
is defined as the negative of the total cost, where
with
d being the distance from truck to hub or drone to customer
i,
the variable cost per km of the truck or drone, and
representing the base costs of the vehicles. The reward encourages minimizing total distance-based costs and network penalties.
Here, the goal is to learn both an optimal policy
, parameterized by
, and a value function
, parameterized by
, which estimates the expected future rewards from state
. The policy network (actor) selects actions, and the value network (critic) evaluates these actions. The actor’s objective is to maximize the expected discounted return:
where
is the discount factor. In the actor–critic method, we use the advantage function
to update the policy:
This advantage function measures how much better or worse the action
performed compared to the baseline value
provided by the critic.
The critic’s goal is to approximate the expected future rewards through the value function
. The critic is updated by minimizing the temporal difference error:
This loss encourages the critic to accurately predict future rewards, improving the stability of the actor’s updates. The training procedure is represented in
Figure 10.
By separating the actor and the critic, the actor–critic method reduces the variance in policy gradient estimates, providing a more stable and efficient learning process. The critic serves to provide a better estimate of the future reward, helping the actor to adjust its policy more effectively. The optimization results are represented in
Figure 11.
Figure 11 shows the reward or negative cost per episode, reflecting the agent’s performance in minimizing the total cost associated with micro hub location optimization. As training progresses, the rewards increase, indicating improved policy decisions by the actor in selecting optimal micro hub positions. Fluctuations in the reward early in training represent exploration, while the steady rise in later episodes demonstrates the convergence of the policy as the agent learns to reduce logistics and network-related costs. Therefore, the application of the synthetic dataset to the micro hub location optimization task demonstrates the dataset’s utility in training machine learning models for logistics network design.
5.3. Vehicle Routing Problem
After determining the location of the micro hub, the subsequent journey to the customers is assumed to be carried out using drones. Drones typically follow direct routes, transforming the problem into a Vehicle Routing Problem (VRP). The VRP is a key optimization challenge in logistics and transportation, focusing on determining the most efficient routes for servicing a given number of customers. The objective is to minimize a cost function, such as total distance traveled or time taken, while adhering to constraints like vehicle capacities, time windows, and route restrictions. In this case, the focus is on minimizing distance.
The Vehicle Routing Problem can be mathematically modeled as a graph , where represents the set of nodes, including the depot () and n customer locations. The edges denote possible routes between nodes, each associated with a cost , typically representing the Euclidean distance or travel time. The objective is to minimize the total cost of all routes:
where
is a binary decision variable indicating whether the route includes travel from node
i to node
j. The formulation is subject to the following constraints:
- 1.
Flow Conservation: Each customer is visited exactly once:
- 2.
Sub-tour Elimination: Disconnected sub-tours are prohibited, ensuring a single cohesive route:
- 3.
Capacity Constraint: The total demand served by any route cannot exceed the vehicle’s capacity Q:
where
represents the demand at node
i.
To solve the VRP, we begin by calculating the pairwise distances between nodes using the Euclidean distance formula, forming a distance matrix. Routes are represented as sequences of node indices, and the total distance of a given route is computed by summing the distances between consecutive nodes, including the return to the starting point. This implementation serves as a basic framework for solving the VRP where four examples are represented in
Figure 12.
This example demonstrates a classical Vehicle Routing Problem optimization. By incorporating additional features from our synthetic dataset, we can account for factors like network quality and wind conditions. As a result, the routes between customers are no longer direct lines, since they must be adaptable to environmental conditions. Therefore, we introduce a path-planning approach that considers these factors in the following section.
5.4. Path Planning
Pathfinding for drones in dynamic environments often involves not only geometric constraints but also environmental factors that influence the traversal cost. The A* algorithm is a graph traversal and path search algorithm. It combines the advantages of Dijkstra’s algorithm (minimizing the path cost) and Greedy Best-First Search (using a heuristic to guide the search). The total cost function at any node n is defined as , where is the cost from the start node to n and is a heuristic estimate of the cost from n to the goal. To incorporate dynamic costs based on external factors, specifically internet connectivity and wind conditions, we extend the A* algorithm. These factors influence the movement cost at each grid cell, which the algorithm integrates to find an optimal path. Additionally, we extend the A* algorithm to support diagonal movements, which have a cost of . The heuristic is updated to use the Euclidean distance .
In our approach, the cost function includes additional terms reflecting internet connectivity and wind conditions where is the accumulated cost to reach n, depends on whether the movement is diagonal () or straight (1), and . The internet connectivity cost penalizes regions with poor signal strength. The value is derived from a predefined grid or real-time signal strength map. High signal strength corresponds to lower costs. The wind cost accounts for wind speed and direction. Movements against strong winds incur higher penalties, while movements with the wind may receive a cost reduction.
We applied the algorithm on the 2D grid of
Figure 12 of size
, with internet connectivity and wind cost maps generated using our synthetic data.
Figure 13 illustrates the new paths calculated by the algorithm under varying conditions.
Figure 13 demonstrates that the algorithm adapts to environmental factors, choosing paths that minimize total cost even when such paths are geometrically longer. Consequently, the VRP of
Figure 12 has no longer direct paths and is modified. Therefore, we demonstrate that our synthetic dataset can also be used for path planning for drones considering different environmental scenarios which may lead to a change in the VRP optimization. Consequently, these two challenges must be addressed simultaneously in real-world scenarios, which can be effectively achieved using our data generator. By leveraging this tool, it becomes possible to simulate diverse and realistic conditions, enabling comprehensive testing and development of solutions that are better suited for practical applications.
6. Conclusions
This paper presents a novel synthetic dataset generator tailored for evaluating logistical optimization scenarios. The generator creates diverse and realistic datasets that simulate various customer distributions, network qualities, and logistical constraints, providing a robust foundation for testing and developing optimization algorithms. By offering detailed control over dataset parameters and incorporating real-world complexity, the synthetic dataset enhances the evaluation of optimization strategies. The demonstrated utility of this dataset in reinforcement learning experiments underscores its potential to advance research in logistics and operations management. Future work may extend this framework to other domains and refine the dataset generator to further align with emerging challenges in logistics optimization, such as job shop scheduling. Finally, the objective of these datasets is to establish a standardized benchmark. The concept is that researchers can leverage the datasets tailored to specific optimization problems detailed in
Section 3 to evaluate and compare the performance of their algorithms. We will release datasets for each of the optimization problems in
https://doi.org/10.5281/zenodo.14037047 (accessed on 22 November 2024) in a phased manner, ensuring a steady supply for algorithm performance comparison.
6.1. Recommendation for Future Users
We recommend that users utilize this generator to explore various optimization strategies, particularly in areas such as drone-based delivery, route planning, and supply chain management. When using the dataset generator, it is important to consider several factors to ensure the efficient generation of realistic datasets within a reasonable timeframe. One of the key elements affecting the speed of dataset generation is the number of customers. Since the generator calculates all possible delivery routes through an API, having too many customers can significantly slow down the process. To optimize performance, it is recommended to activate routes via the API only when necessary. If generating routes for all customers is essential, reducing the overall number of customers will help ensure that a larger number of samples can be generated more quickly, without compromising the variety needed for testing optimization algorithms.
Another important factor is the size of the geographic area used for deliveries. Larger maps, with expansive longitude and latitude ranges, lead to longer routes, which in turn increases computation time. To improve efficiency, it is advisable to keep the map size within realistic limits, ensuring that the minimum and maximum longitude and latitude values are not excessively large. This will help avoid overly long delivery routes, thus reducing generation time. Additionally, it is essential to maintain a realistic scope for the deliveries. This means ensuring that the distances between customers are appropriate for the type of delivery being simulated and all places should be accessible with any type of vehicle.
6.2. Future Research
This work provides a foundation for analyzing various optimization problems in future research. In future work, we aim to leverage this synthetic data generator to conduct a comprehensive analysis of various optimization problems within multimodal delivery systems. By conducting controlled experiments, we will compare the performance of different algorithms and identify the most promising approaches for real-world applications. This research will enable us to propose novel algorithms and strategies to enhance the efficiency and sustainability of multimodal delivery operations. Looking ahead, ongoing improvement in the dataset generator could further enhance its applicability, especially in addressing emerging challenges in logistics optimization. Specifically, the integration of dynamic and uncertain factors, such as unexpected disruptions in the supply chain. By including these uncertainties, the generator can create more realistic and complex datasets that challenge optimization algorithms to better account for real-world variability. Through these developments, the generator will continue to serve as a valuable tool for testing, comparing, and improving logistical optimization algorithms.