DE102023102770A1

DE102023102770A1 - METHOD AND DEVICE FOR TRAINING A MODEL BASED ON ARTIFICIAL INTELLIGENCE

Info

Publication number: DE102023102770A1
Application number: DE102023102770.0A
Authority: DE
Inventors: Sebastian Schmidt
Original assignee: Bayerische Motoren Werke AG
Current assignee: Bayerische Motoren Werke AG
Priority date: 2023-02-06
Filing date: 2023-02-06
Publication date: 2024-08-08

Abstract

Bereitgestellt wird ein Computer-implementiertes Verfahren zum Trainieren eines auf künstlicher Intelligenz beruhenden Modells (KI-Modell) zum Erstellen eines Umfeldmodells für ein automatisiertes Kraftfahrzeug. Das Verfahren umfasst ein Bereitstellen eines Trainingsdatensatzes umfassend gelabelte und ungelabelte Sensordaten von Umfeldsensoren des Kraftfahrzeugs, und ein Trainieren des KI-Modells mit den gelabelten Sensordaten. Das Verfahren umfasst ein Trainieren eines Autoencoders mit den gelabelten Sensordaten, ein Erzeugen eines Umfeldmodells basierend auf den ungelabelten Sensordaten mittels des trainierten KI-Modells, ein Erzeugen von Gitternetz-Repräsentationen des basierend auf den ungelabelten Sensordaten mittels des trainierten KI-Modells erzeugten Umfeldmodells, ein Erzeugen von komprimierten Repräsentationen der Gitternetz-Repräsentationen mittels des trainierten Autoencoders, und ein Durchführen einer Kernsatzauswahl basierend auf den komprimierten Repräsentationen, um so zumindest ein Set von Sensordaten aus den ungelabelten Sensordaten für ein weiteres Trainieren des KI-Modells auszuwählen. A computer-implemented method is provided for training a model based on artificial intelligence (AI model) for creating an environment model for an automated motor vehicle. The method comprises providing a training data set comprising labeled and unlabeled sensor data from environment sensors of the motor vehicle, and training the AI model with the labeled sensor data. The method comprises training an autoencoder with the labeled sensor data, generating an environment model based on the unlabeled sensor data using the trained AI model, generating grid representations of the environment model generated based on the unlabeled sensor data using the trained AI model, generating compressed representations of the grid representations using the trained autoencoder, and performing a core set selection based on the compressed representations in order to select at least one set of sensor data from the unlabeled sensor data for further training of the AI model.

Description

Die vorliegende Offenbarung betrifft ein computer-implementiertes Verfahren zum Trainieren eines auf künstlicher Intelligenz beruhenden Modells (KI-Modells) und/oder eine Vorrichtung zur Datenverarbeitung bzw. eine Datenverarbeitungsvorrichtung, die ausgestaltet ist, um das Verfahren auszuführen. Zusätzlich, oder alternativ, wird ein Computerprogramm bereitgestellt, das Befehle umfasst, die bei der Ausführung des Programms durch einen Computer diesen dazu veranlassen, das Verfahren zumindest teilweise auszuführen. Zusätzlich, oder alternativ, wird ein computerlesbares Medium bereitgestellt, das Befehle umfasst, die bei der Ausführung der Befehle durch einen Computer diesen dazu veranlassen, das Verfahren zumindest teilweise auszuführen. Die vorliegende Offenbarung betrifft, zusätzlich oder alternativ, eine Verwendung eines mit dem Verfahren trainierten KI-Modells.The present disclosure relates to a computer-implemented method for training a model based on artificial intelligence (AI model) and/or a device for data processing or a data processing device that is designed to carry out the method. Additionally or alternatively, a computer program is provided that includes instructions that, when the program is executed by a computer, cause the computer to at least partially carry out the method. Additionally or alternatively, a computer-readable medium is provided that includes instructions that, when the instructions are executed by a computer, cause the computer to at least partially carry out the method. The present disclosure relates, additionally or alternatively, to a use of an AI model trained with the method.

In modernen Kraftfahrzeugen, insbesondere Automobilen, werden vermehrt Fahrerassistenzsysteme verbaut.Driver assistance systems are increasingly being installed in modern motor vehicles, especially automobiles.

Fahrerassistenzsysteme (FAS; engl. (Advanced) Driver Assistance Systems, (A)DAS) sind elektronische, insbesondere mechatronische, Einrichtungen in Kraftfahrzeugen zur Unterstützung des Fahrers in bestimmten Fahrsituationen. Hierbei stehen oft Sicherheitsaspekte, aber auch die Steigerung des Fahrkomforts im Vordergrund.Driver assistance systems (FAS; (Advanced) Driver Assistance Systems, (A)DAS) are electronic, particularly mechatronic, devices in motor vehicles to support the driver in certain driving situations. The focus here is often on safety aspects, but also on increasing driving comfort.

Solche Fahrerassistenzsystem greifen, beispielsweise zur Planung einer Trajektorie des automatisierten Kraftfahrzeugs, auf ein Umfeldmodell zurück, welches ein Modell eines Umfeld des Kraftfahrzeugs aufweisend statische und dynamische Objekte, die sich in einem Umfeld des Kraftfahrzeugs befinden, umfasst.Such driver assistance systems rely, for example, on an environment model to plan a trajectory of the automated motor vehicle, which includes a model of an environment of the motor vehicle comprising static and dynamic objects that are located in an environment of the motor vehicle.

Zur Erstellung eines solchen Umfeldmodells können auf künstlicher Intelligenz beruhende Objekterkennungsalgorithmen, vorliegend allgemein als KI-Modell bezeichnet, genutzt werden, welche im trainierten Zustand ausgestaltet sind, um basierend auf Sensordaten einer Umfeldsensorik des Kraftfahrzeugs, z.B. umfassend einen Radarsensor, eine Kamera, einen LiDar-Sensor und/oder einen Infrarotsensor, das Umfeldmodell zu erzeugen.To create such an environment model, object recognition algorithms based on artificial intelligence, generally referred to here as AI models, can be used, which in the trained state are designed to generate the environment model based on sensor data from an environment sensor system of the motor vehicle, e.g. comprising a radar sensor, a camera, a LiDar sensor and/or an infrared sensor.

Ein Trainieren eines solche KI-Modells erfolgt zumeist im Rahmen eines überwachten Lernens. Dabei lernt das KI-Modell eine Funktion aus gegebenen Paaren von Ein- und Ausgaben, den sog. gelabelten Trainingsdaten, bei welchen es sich vorliegend um Sensordaten handelt. Ziel beim überwachten Lernen ist, dass dem KI-Modell nach mehreren Durchgängen mit unterschiedlichen Ein- und Ausgaben die Fähigkeit antrainiert wird, Assoziationen herzustellen.Training such an AI model is usually done as part of supervised learning. The AI model learns a function from given pairs of inputs and outputs, the so-called labeled training data, which in this case is sensor data. The aim of supervised learning is that the AI model is trained to make associations after several runs with different inputs and outputs.

Das Trainieren solch eines KI-Modells erfordert jedoch ein zeit- und arbeitsaufwändiges Labeln der für das Training verwendeten Sensordaten, insbesondere wenn es sich dabei um mehrdimensionale Daten (wie z.B. bei einem LiDAR-Sensor der Fall) handelt, sodass unter anderem deswegen die Bestrebung besteht, eine Anzahl an für das Trainieren nötigen gelabelten Sensordaten so gering wie möglich zu halten.However, training such an AI model requires time-consuming and labor-intensive labeling of the sensor data used for training, especially when it is multidimensional data (as is the case with a LiDAR sensor, for example), so one of the reasons for this is to keep the amount of labeled sensor data required for training as low as possible.

Ein Ansatz ist dabei das sog. aktive Lernen, welches ein Teilgebiet des iterativen überwachten Lernens darstellt. Das aktive Lernen ist also ein Spezialfall des maschinellen Lernens, bei dem ein Lernalgorithmus interaktiv eine Informationsquelle abfragen kann, um bis dato ungelabelte Trainingsdaten für eine weiteres Trainieren gelabelt zur Verfügung gestellt zu bekommen. In der Statistikliteratur wird es manchmal auch als optimale Versuchsplanung bezeichnet. Die Informationsquelle wird auch als Lehrer oder Orakel bezeichnet.One approach is so-called active learning, which is a sub-area of iterative supervised learning. Active learning is therefore a special case of machine learning in which a learning algorithm can interactively query an information source in order to obtain previously unlabeled training data in a labeled form for further training. In the statistics literature, it is sometimes referred to as optimal experimental design. The information source is also referred to as a teacher or oracle.

Da das KI-Modell die zu labelnden Sensordaten selbst auswählt, kann die Anzahl der Sensordaten, die zum Trainieren benötigt werden, oft viel geringer sein als beim herkömmlichen überwachten Lernen, bei dem alle zur Verfügung stehenden Sensordaten gelabelt werden.Because the AI model itself selects the sensor data to label, the amount of sensor data needed for training can often be much smaller than in traditional supervised learning, where all available sensor data is labeled.

Das aktive Lernen wurde jedoch bis dato hauptsächlich für Bildklassifizierungsaufgaben evaluiert und zeigte bei hochdimensionalen Daten eine unzureichende Leistung.However, active learning has so far been evaluated mainly for image classification tasks and showed insufficient performance on high-dimensional data.

Vor dem Hintergrund dieses Standes der Technik besteht die Aufgabe der vorliegenden Offenbarung darin, eine Vorrichtung und/oder ein Verfahren anzugeben, die jeweils geeignet sind, den Stand der Technik zu bereichern.Against the background of this prior art, the object of the present disclosure is to provide a device and/or a method which are each suitable for enriching the prior art.

Gelöst wird die Aufgabe durch die Merkmale der unabhängigen Ansprüche. Die nebengeordneten Ansprüche und die Unteransprüche haben jeweils optionale Weiterbildungen der Offenbarung zum Inhalt.The problem is solved by the features of the independent claims. The subordinate claims and the dependent claims each contain optional further developments of the disclosure.

Danach wird die Aufgabe durch ein computer-implementiertes Verfahren zum Trainieren eines auf künstlicher Intelligenz beruhenden Modells (KI-Modell) zum Erstellen eines Umfeldmodells für ein automatisiertes Kraftfahrzeug gelöst.The task is then solved by a computer-implemented method for training an artificial intelligence-based model (AI model) to create an environment model for an automated motor vehicle.

Das Verfahren umfasst ein Bereitstellen eines Trainingsdatensatzes umfassend gelabelte und ungelabelte Sensordaten von Umfeldsensoren des Kraftfahrzeugs, und ein Trainieren des KI-Modells mit den gelabelten Sensordaten.The method comprises providing a training data set comprising labeled and unlabeled sensor data from environmental sensors of the motor vehicle, and training the AI model with the labeled sensor data.

Das Verfahren umfasst ferner ein Trainieren eines Autoencoders mit den gelabelten Sensordaten, ein Erzeugen eines Umfeldmodells basierend auf den ungelabelten Sensordaten mittels des trainierten KI-Modells, ein Erzeugen von Gitternetz-Repräsentationen des basierend auf den ungelabelten Sensordaten mittels des trainierten KI-Modells erzeugten Umfeldmodells, ein Erzeugen von komprimierten Repräsentationen der Gitternetz-Repräsentationen mittels des trainierten Autoencoders, und ein Durchführen einer Kernsatzauswahl.The method further comprises training an autoencoder with the labeled sensor data, generating an environment model based on the unlabeled sensor data using the trained AI model, generating mesh representations of the environment model generated based on the unlabeled sensor data using the trained AI model, generating compressed representations of the mesh representations using the trained autoencoder, and performing a core set selection.

Die Kernsatzauswahl kann basierend auf den komprimierten Repräsentationen durchgeführt werden, um so zumindest ein Set von Sensordaten aus den ungelabelten Sensordaten zum Labeln für ein weiteres Trainieren des KI-Modells auszuwählen.Core set selection can be performed based on the compressed representations to select at least one set of sensor data from the unlabeled sensor data to label for further training of the AI model.

Unter einem computer-implementierten Verfahren kann ein Verfahren verstanden werden, bei dem zumindest ein Schritt des Verfahrens zumindest teilweise und/oder zeitweise von einem Computer bzw. einer Datenverarbeitungsvorrichtung ausgeführt wird.A computer-implemented method can be understood to mean a method in which at least one step of the method is carried out at least partially and/or temporarily by a computer or a data processing device.

Unter dem Begriff des Trainierens kann vorliegend maschinelles Lernen verstanden werden. Unter maschinellem Lernen kann eine künstliche Generierung von Wissen aus Erfahrung verstanden werden. Beim maschinellen Lernen lernt ein künstliches System, vorliegend als das auf künstlicher Intelligenz beruhende Modell bezeichnet, aus Beispielen und kann diese nach Beendigung der Lernphase verallgemeinern. Dazu bauen Algorithmen beim maschinellen Lernen ein statistisches Modell auf, das auf Trainingsdaten beruht und welches gegengetestet wird. Das heißt, es werden nicht einfach die Beispiele auswendig gelernt, sondern Muster und Gesetzmäßigkeiten in den Lerndaten erkannt. So kann das System auch unbekannte Daten beurteilen (sogenannter Lerntransfer). Vorliegend wird der eingangs beschriebene Ansatz des aktiven Lernens gewählt (Details s. oben).The term training can be understood here as machine learning. Machine learning can be understood as the artificial generation of knowledge from experience. In machine learning, an artificial system, here referred to as the model based on artificial intelligence, learns from examples and can generalize them after the learning phase has ended. To do this, algorithms in machine learning build a statistical model that is based on training data and which is counter-tested. This means that the examples are not simply learned by heart, but patterns and regularities are recognized in the learning data. This enables the system to evaluate unknown data (so-called learning transfer). In this case, the active learning approach described at the beginning is chosen (see above for details).

Der Autoencoder kann ein künstliches neuronales Netz umfassen oder daraus bestehen, das dazu genutzt wird, um effiziente Codierungen zu lernen. Das Ziel eines Autoencoders ist es, eine komprimierte Repräsentation (sog. Encoding) für einen Satz Daten zu lernen und somit auch wesentliche Merkmale zu extrahieren. Dadurch kann der Autoencoder zur Dimensionsreduktion genutzt werden.The autoencoder can include or consist of an artificial neural network that is used to learn efficient encodings. The goal of an autoencoder is to learn a compressed representation (so-called encoding) for a set of data and thus to extract essential features. This allows the autoencoder to be used for dimensionality reduction.

Der Autoencoder kann mehrere Schichten umfassen, insbesondere eine Eingabeschicht, einige im Vergleich zur Eingabeschicht signifikant kleinere Schichten, die das Encoding bilden, und eine Ausgabeschicht, in der jedes Neuron die gleiche Bedeutung hat wie das entsprechende in der Eingabeschicht. Intuitiv gesehen projiziert der Encoder beim Trainieren des Autoencoders die anfänglichen Daten (hier die Gitternetz-Repräsentationen) in einen Raum mit üblicherweise tieferer bzw. kleiner(er) Dimension, und der Decoder hat als Aufgabe, die ursprünglichen Daten wieder zu rekonstruieren. Das Ziel ist es nun, Encoder und Decoder so zu wählen, dass der Unterschied zwischen den ursprünglichen Daten und den rekonstruierten Daten möglichst gering ist. Die Ausgabeschicht wird nur für das Trainieren des Autoencoders genutzt, wohingegen die das Encoding bildende kleineren Schichten zum Komprimieren der Gitternetz-Repräsentationen der ungelabelten Sensordaten genutzt werden, welche wiederum für die Auswahl der zu labelnden Sensordaten mittels Kernsatzauswahl (engl. core-set selection) des genutzt werden.The autoencoder can comprise several layers, in particular an input layer, some layers that are significantly smaller than the input layer and form the encoding, and an output layer in which each neuron has the same meaning as the corresponding one in the input layer. Intuitively, when training the autoencoder, the encoder projects the initial data (here the grid representations) into a space with usually deeper or smaller dimensions, and the decoder has the task of reconstructing the original data. The goal is now to choose the encoder and decoder so that the difference between the original data and the reconstructed data is as small as possible. The output layer is only used for training the autoencoder, whereas the smaller layers that form the encoding are used to compress the grid representations of the unlabeled sensor data, which in turn are used to select the sensor data to be labeled using core-set selection.

Die Idee hinter der Kernsatzauswahl bzw. dem Kernsatzansatz (engl. core-set selection/approach) besteht darin, eine Teilmenge s von Datenpunkten aus einer Gesamtmenge S auszuwählen, sodass alle verbleibenden Elemente von S sich im Radius δ zum nächsten Element von s befinden, d. h. s ist eine δ-Abdeckung von S. Die Menge s soll bei einer vorgegeben Größe so ausgewählt werden das der Radius δ minimal ist. Wie im aktiven Lernen üblich, wird auch dieser Prozess iterativ durchgefühlt und die Teilmenge der einzelnen Selektionen sⁱ (wobei i der Index des Durchlaufs 1,2,3 ... ist) akkumuliert. $min_{s^{1} : | s^{1} \leq b |} δ_{s^{0} \cup s^{1}}$

The idea behind core-set selection or the core-set approach is to select a subset s of data points from a total set S such that all remaining elements of S are within radius δ of the next element of s, i.e. s is a δ-cover of S. The set s should be selected for a given size such that the radius δ is minimal. As is usual in active learning, this process is also carried out iteratively and the subset of the individual selections s ⁱ (where i is the index of the

run

1,2,3 ...) is accumulated.

min_{s^{1} : | s^{1} \leq b |} δ_{s^{0} \cup s^{1}}

Durch eine Formulierung des Problem als k-Center-Problem kann eine berechnungseffizente 2-OPT-Lösung mithilfe eines Greedy-Algorithmus (https://de.wikipedia.org/wiki/Greedy-Algorithmus) aufgestellt werden, da dieser äquivalent ist zu: $min_{s^{1} : | s^{1} | \leq b} max_{i} min_{j \in s^{1} \cup s^{0}} Δ (x_{i}, x_{j})$

By formulating the problem as a k-center problem, a computationally efficient 2-OPT solution can be established using a greedy algorithm (https://de.wikipedia.org/wiki/Greedy-Protokoll), since it is equivalent to:

min_{s^{1} : | s^{1} | \leq b} \underset{i}{Max} min_{j \in s^{1} \cup s^{0}} Δ (x_{i}, x_{j})

Das offenbarungsgemäße Verfahren bietet inter alia den Vorteil, dass das aktive Lernen auf niedrigdimensionalen Umgebungseinbettungen anstelle von mehrdimensionale Sensorrohdaten, insbesondere LiDAR-Rohdaten, durchgeführt wird, und dass der Ansatz unabhängig von unterschiedlichen Hardware-Konfigurationen ist und unabhängig von der eigentlichen Aufgabe des KI-Modells ist.The method according to the disclosure offers, inter alia, the advantage that active learning is performed on low-dimensional environment embeddings instead of multidimensional raw sensor data, in particular LiDAR raw data, and that the approach is independent of different hardware configurations and independent of the actual task of the AI model.

Nachfolgend werden mögliche Weiterbildungen des oben beschriebenen Verfahrens im Detail erläutert.Possible further developments of the procedure described above are explained in detail below.

Die Umfeldsensoren einen LiDAR-Sensor aufweisen und die gelabelte und ungelabelte Sensordaten zumindest teilweise von dem LiDAR-Sensor aufgenommen wurden.The environmental sensors have a LiDAR sensor and the labeled and unlabeled sensor data were at least partially recorded by the LiDAR sensor.

Das Erzeugen der Gitternetz-Repräsentationen kann ein Erzeugen einer zweidimensionalen Vogelperspektiven-Ansicht basierend auf dem Umfeldmodell, welches nach dessen Erzeugung durch das KI-Modell als dreidimensionales Modell vorliegt, und ein Ableiten der Gitternetz-Repräsentationen aus der erzeugten zweidimensionalen Vogelperspektiven-Ansicht umfassen.Generating the grid representations may include generating a two-dimensional bird's eye view based on the environment model, which is present as a three-dimensional model after it is generated by the AI model, and deriving the grid representations from the generated two-dimensional bird's eye view.

Mit anderen Worten, das mittels des KI-Modells erzeugte Umfeldmodell ist zunächst ein räumliches Modell des Umfelds des Kraftfahrzeugs, welches dann in ein zweidimensionales Modell überführt wird. Das zweidimensionale Modell stellt die Szene bzw. das Umfeld in einer Draufsicht (d.h. mit Blickrichtung aus einer Z-Richtung von oben) dar und wird auch als Bird's-Eye-View (BEV) bezeichnet. Über diese Draufsicht wird ein Gitter gelegt, welches beispielsweise aus gleich großen quadratische Zellen aufgebaut sein kann. Jeder Zelle wird ein bestimmter in dem Umfeldmodell an der Stelle der jeweiligen Zellen enthaltener Informationsgehalt zugeordnet, sodass eine Gitternetz-Repräsentation entsteht. Da sich das Umfeldmodell mit der Zeit verändert, können aus dem Umfeldmodell mehrere zeitlich aufeinander folgende Gitternetz-Repräsentationen abgeleitet werden.In other words, the environment model generated using the AI model is initially a spatial model of the environment of the motor vehicle, which is then converted into a two-dimensional model. The two-dimensional model represents the scene or the environment in a top view (i.e. looking from a Z direction from above) and is also known as a bird's eye view (BEV). A grid is placed over this top view, which can be made up of square cells of equal size, for example. Each cell is assigned a specific information content contained in the environment model at the location of the respective cells, so that a grid representation is created. Since the environment model changes over time, several chronologically successive grid representations can be derived from the environment model.

Das Ableiten der Gitternetz-Repräsentationen aus der erzeugten zweidimensionalen Vogelperspektiven-Ansicht kann je Gitternetz-Repräsentation ein Ableiten eines Belegungs-Gitternetzes, das Informationen über eine Belegung einzelner das Belegungs-Gitternetz bildender Rechtecke basierend auf der Vogelperspektiven-Ansicht, und ein Ableiten jeweils eines Geschwindigkeits-Gitternetzes und eines Beschleunigungs-Gitternetzes für ein X-Richtung und eine Y-Richtung basierend auf der Vogelperspektiven-Ansicht und des Belegungs-Gitternetzes umfassen.Deriving the grid representations from the generated two-dimensional bird's eye view may comprise, for each grid representation, deriving an occupancy grid that contains information about an occupancy of individual rectangles forming the occupancy grid based on the bird's eye view, and deriving a velocity grid and an acceleration grid for an X direction and a Y direction based on the bird's eye view and the occupancy grid.

Das heißt, das Belegungs-Gitternetz kann je Zelle eine Information darüber umfassen, ob sich an der jeweiligen Stelle in dem Umfeldmodell ein Objekt befindet oder nicht. Das Belegungs-Gitternetz umfasst daher die räumlichen Informationen aus dem Umfeldmodell. Die zugehörigen Geschwindigkeits-Gitternetze und die zugehörigen Beschleunigungs-Gitternetze ordnen diesen Objekten nun eine Geschwindigkeit und eine Beschleunigung in X und Y, d.h. in der horizontalen Ebene, zu. Die Geschwindigkeits-Gitternetze und die Beschleunigungs-Gitternetze umfassen folglich die dynamischen bzw. zeitlichen Informationen aus dem Umfeldmodell.This means that the occupancy grid can contain information for each cell about whether or not there is an object at the respective location in the environment model. The occupancy grid therefore contains the spatial information from the environment model. The associated speed grids and the associated acceleration grids now assign a speed and an acceleration in X and Y, i.e. in the horizontal plane, to these objects. The speed grids and the acceleration grids therefore contain the dynamic or temporal information from the environment model.

Der Autoencoder kann einen Raum-Zeit-Autoencoder umfassen, der ausgestaltet ist, um aus den Belegungs-Gitternetz räumliche Informationen zu extrahieren und aus dem Geschwindigkeits-Gitternetzen und dem Beschleunigungs-Gitternetzen zeitliche Informationen zu extrahieren.The autoencoder may comprise a space-time autoencoder configured to extract spatial information from the occupancy grids and to extract temporal information from the velocity grids and the acceleration grids.

Die Sensordaten können während einer Fahrt mit dem Kraftfahrzeug kontinuierlich aufgenommen worden sein.The sensor data may have been recorded continuously during a journey in the motor vehicle.

Ferner wird eine Datenverarbeitungsvorrichtung bereitgestellt, wobei die Datenverarbeitungsvorrichtung ausgestaltet ist, um das oben beschriebene Verfahren zumindest teilweise auszuführen.Furthermore, a data processing device is provided, wherein the data processing device is designed to at least partially carry out the method described above.

Bei der Datenverarbeitungsvorrichtung kann es sich um einen Computer handeln, welcher einen Prozessor und einen Speicher aufweist, wobei in dem Speicher ein Computerprogramm hinterlegt ist, welches Anweisungen bzw. Befehle umfasst, die den Computer veranlassen, das oben beschriebene Verfahren zumindest teilweise auszuführen, wenn der Prozessor das in dem Speicher hinterlegte Computerprogramm lädt und ausführt.The data processing device can be a computer having a processor and a memory, wherein a computer program is stored in the memory, which includes instructions or commands that cause the computer to at least partially carry out the method described above when the processor loads and executes the computer program stored in the memory.

Das oben mit Bezug zum Verfahren Beschriebene gilt analog auch für die Datenverarbeitungsvorrichtung und umgekehrt.What is described above with reference to the method also applies analogously to the data processing device and vice versa.

Ferner wird ein Computerprogramm, umfassend Befehle, die bei der Ausführung des Programms durch einen Computer diesen veranlassen, das oben beschriebene Verfahren zumindest teilweise aus- bzw. durchzuführen, bereitgestellt.Furthermore, a computer program is provided, comprising instructions which, when the program is executed by a computer, cause the computer to at least partially carry out or implement the method described above.

Das oben mit Bezug zum Verfahren und zur Datenverarbeitungsvorrichtung Beschriebene gilt analog auch für das Computerprogramm und umgekehrt.What is described above with reference to the method and the data processing device also applies analogously to the computer program and vice versa.

Ferner wird ein computerlesbares Medium, insbesondere computerlesbares Speichermedium, bereitgestellt. Das computerlesbare Medium umfasst Befehle, die bei der Ausführung der Befehle durch einen Computer diesen veranlassen, das oben beschriebene Verfahren zumindest teilweise auszuführen.Furthermore, a computer-readable medium, in particular a computer-readable storage medium, is provided. The computer-readable medium comprises instructions which, when executed by a computer, cause the computer to at least partially carry out the method described above.

Das heißt, es kann ein computerlesbares Medium bereitgestellt werden, das ein oben definiertes Computerprogramm umfasst.That is, a computer-readable medium may be provided which comprises a computer program as defined above.

Bei dem computerlesbaren Medium kann es sich um ein beliebiges digitales Datenspeichergerät handeln, wie z. B. einen USB-Stick, eine Festplatte, eine CD-ROM, eine SD-Karte oder eine SSD-Karte (bzw. eine SSD-Festplatte).The computer-readable medium can be any digital data storage device, such as a USB stick, a hard disk, a CD-ROM, an SD card or an SSD card (or an SSD hard disk).

Das Computerprogramm muss jedoch nicht zwingend auf einem solchen computerlesbaren Speichermedium gespeichert sein, um dem Computer zur Verfügung gestellt zu werden, sondern kann auch über das Internet oder anderweitig extern bezogen werden. Mit anderen Worten bei dem computerlesbaren Medium kann es sich auch um ein Datensignal handeln.However, the computer program does not necessarily have to be stored on such a computer-readable storage medium in order to be made available to the computer, but can also be accessed via the Internet or otherwise externally. In other words, the computer-readable medium can also be a data signal.

Das oben mit Bezug zum Verfahren, zur Datenverarbeitungsvorrichtung, zum Computerprogramm Beschriebene gilt analog auch für das computerlesbare Medium und umgekehrt.What is described above with reference to the method, the data processing device and the computer program also applies analogously to the computer-readable medium and vice versa.

Ferner betrifft die Offenbarung eine Verwendung eines mit dem oben beschriebenen Verfahren trainierten KI-Modells im Rahmen eines automatisierten Fahrens eines automatisierten KraftfahrzeugsFurthermore, the disclosure relates to a use of an AI model trained with the method described above in the context of automated driving of an automated motor vehicle

Das Kraftfahrzeug kann eine Steuereinheit bzw. Steuervorrichtung umfassen. Bei dem Kraftfahrzeug kann es sich um einen Personenkraftwagen, insbesondere ein Automobil, oder ein Nutzfahrzeug, wie z. B. einen Lastkraftwagen, handeln.The motor vehicle can comprise a control unit or control device. The motor vehicle can be a passenger car, in particular an automobile, or a commercial vehicle, such as a truck.

Das Kraftfahrzeug kann automatisiert sein. Das Kraftfahrzeug kann ausgestaltet sein, um mittels der Steuervorrichtung eine Längsführung und/oder eine Querführung bei einem automatisierten Fahren des Kraftfahrzeugs zumindest teilweise und/oder zumindest zeitweise zu übernehmen.The motor vehicle can be automated. The motor vehicle can be designed to at least partially and/or at least temporarily assume longitudinal guidance and/or transverse guidance by means of the control device during automated driving of the motor vehicle.

Die Steuervorrichtung kann dafür auf ein mit dem oben beschriebenen Verfahren trainiertes KI-Modell, insbesondere aus ein damit erzeugte Umfeldmodell, zurückgreifen.For this purpose, the control device can use an AI model trained using the method described above, in particular an environment model generated thereby.

Das automatisierte Fahren kann so erfolgen, dass die Fortbewegung des Kraftfahrzeugs (weitgehend) autonom erfolgt. Das automatisierte Fahren kann zumindest teilweise und/oder zeitweise durch die Steuervorrichtung gesteuert werden.Automated driving can be carried out in such a way that the movement of the motor vehicle is (largely) autonomous. Automated driving can be controlled at least partially and/or temporarily by the control device.

Denkbar ist, dass das Kraftfahrzeug durch ein Fahrassistenzsystem aktiv, z. B. durch eine Anpassung einer Ist-Lenkradposition, und optional passiv, z. B. durch eine Anzeige eines Abbiegehinweises, in die Querführung des Kraftfahrzeugs eingreift.It is conceivable that the motor vehicle intervenes in the lateral guidance of the motor vehicle through a driver assistance system actively, e.g. by adjusting an actual steering wheel position, and optionally passively, e.g. by displaying a turning instruction.

Das Kraftfahrzeug kann ein Kraftfahrzeug der Autonomiestufe 0 sein, d. h. der Fahrer übernimmt die dynamische Fahraufgabe, auch wenn unterstützende Systeme (z. B. ABS oder ESP) vorhanden sind.The motor vehicle can be a motor vehicle with autonomy level 0, i.e. the driver takes over the dynamic driving task, even if supporting systems (e.g. ABS or ESP) are present.

Das Kraftfahrzeug kann ein Kraftfahrzeug der Autonomiestufe 1 sein, d. h. bestimmte Fahrerassistenzsysteme aufweisen, die den Fahrer bei der Fahrzeugbedienung unterstützen, wie beispielsweise der Abstandsregeltempomat (ACC).The motor vehicle may be a motor vehicle of autonomy level 1, i.e. it may have certain driver assistance systems that support the driver in operating the vehicle, such as adaptive cruise control (ACC).

Das Kraftfahrzeug kann ein Kraftfahrzeug der Autonomiestufe 2 sein, d. h. so teilautomatisiert sein, dass Funktionen wie automatisches Einparken, Spurhalten bzw. Querführung, allgemeine Längsführung, Beschleunigen und/oder Abbremsen von Fahrerassistenzsystemen übernommen werden.The motor vehicle can be a motor vehicle of autonomy level 2, i.e. it can be so partially automated that functions such as automatic parking, lane keeping or lateral guidance, general longitudinal guidance, acceleration and/or braking are taken over by driver assistance systems.

Das Kraftfahrzeug kann ein Kraftfahrzeug der Autonomiestufe 3 sein, d. h. so bedingungsautomatisiert, dass der Fahrer das System Fahrzeug nicht durchgehend überwachen muss. Das Kraftfahrzeug führt selbstständig Funktionen wie das Auslösen des Blinkers, Spurwechsel und/oder Spurhalten durch. Der Fahrer kann sich anderen Beschäftigungen widmen, wird aber bei Bedarf innerhalb einer Vorwarnzeit vom System aufgefordert die Führung zu übernehmen.The motor vehicle can be a level 3 autonomy motor vehicle, i.e. so automated that the driver does not have to continuously monitor the vehicle system. The motor vehicle independently carries out functions such as activating the indicator, changing lanes and/or keeping in lane. The driver can attend to other activities, but will be prompted by the system to take over control within a pre-warning period if necessary.

Das Kraftfahrzeug kann ein Kraftfahrzeug der Autonomiestufe 4 sein, d. h. so hochautomatisiert sein, dass die Führung des Fahrzeugs dauerhaft vom System des Fahrzeugs übernommen wird. Werden die Fahraufgaben vom System nicht mehr bewältigt, kann der Fahrer aufgefordert werden, die Führung zu übernehmen.The motor vehicle can be a motor vehicle with autonomy level 4, i.e. it can be so highly automated that the vehicle's system permanently takes over control of the vehicle. If the system can no longer handle the driving tasks, the driver can be asked to take over control.

Das Kraftfahrzeug kann ein Kraftfahrzeug der Autonomiestufe 5 sein, d. h. so vollautomatisiert, dass der Fahrer zum Erfüllen der Fahraufgabe nicht erforderlich ist. Außer zu dem Festlegen des Ziels und zu dem Starten des Systems ist kein menschliches Eingreifen erforderlich. Das Kraftfahrzeug kann ohne Lenkrad und Pedale auskommen.The motor vehicle can be a level 5 autonomy motor vehicle, i.e. so fully automated that the driver is not required to perform the driving task. No human intervention is required other than setting the destination and starting the system. The motor vehicle can operate without a steering wheel and pedals.

Die Steuervorrichtung bzw. Steuereinheit kann Teil eines Fahrassistenzsystems sein oder dieses darstellen. Bei der Steuervorrichtung kann es sich beispielsweise um eine elektronische Steuereinheit (engl. ECU = electronic control unit) handeln. Die elektronische Steuereinheit kann eine intelligente prozessorgesteuerte Einheit sein, die z. B. über ein Central Gateway (CGW) mit anderen Modulen kommunizieren kann und die ggf. über Feldbusse, wie den CAN-Bus, LIN-Bus, MOST-Bus, FlexRay und/oder über das Automotive-Ethernet, z. B. zusammen mit Telematiksteuergeräten und/oder einer Umfeldsensorik, das Fahrzeugbordnetz bilden kann.The control device or control unit can be part of a driver assistance system or represent this. The control device can be, for example, an electronic control unit (ECU). The electronic control unit can be an intelligent processor-controlled unit that can communicate with other modules via a central gateway (CGW), for example, and that can form the vehicle's on-board network via field buses such as the CAN bus, LIN bus, MOST bus, FlexRay and/or the automotive Ethernet, e.g. together with telematics control units and/or environmental sensors.

Denkbar ist, dass die Steuervorrichtung für das Fahrverhalten des Kraftfahrzeugs relevante Funktionen, wie die Lenkung, die Motorsteuerung, die Kraftübertragung, und/oder das Bremssystem, steuert. Außerdem können Fahrerassistenzsysteme, wie beispielsweise ein Parkassistent, eine angepasste Geschwindigkeitsregelung (ACC, engl. Adaptive Cruise Control), ein Spurhalteassistent, ein Spurwechselassistent, eine Verkehrszeichenerkennung, eine Lichtsignalerkennung, ein Anfahrassistent, ein Nachtsichtassistent und/oder ein Kreuzungsassistent, von der Steuervorrichtung gesteuert werden.It is conceivable that the control device controls functions relevant to the driving behavior of the motor vehicle, such as the steering, engine control, power transmission and/or the braking system. In addition, driver assistance systems such as a parking assistant, adaptive cruise control (ACC), lane keeping assistant, lane change assistant, traffic sign recognition, light signal recognition, start-off assistant, night vision assistant and/or intersection assistant can be controlled by the control device.

Das oben mit Bezug zum Verfahren, zur Vorrichtung, zum Computerprogramm und zum computerlesbaren Medium Beschriebene gilt analog auch für die Verwendung und umgekehrt.What is described above with reference to the method, the apparatus, the computer program and the computer-readable medium also applies analogously to the use and vice versa.

Nachfolgend wird eine optionale Ausführungsform mit Bezug zu 1 beschrieben.

1 zeigt schematisch ein Ablaufdiagramm eines computer-implementierten Verfahrens zum Trainieren eines KI-Modells zum Erstellen eines Umfeldmodells für ein automatisiertes Kraftfahrzeug.

Below, an optional embodiment is described with reference to 1 described.

1 shows a schematic flow chart of a computer-implemented method for training an AI model to create an environment model for an automated motor vehicle.

In einem ersten Schritt S1 des Verfahrens erfolgt ein Bereitstellen eines Trainingsdatensatzes umfassend gelabelte Sensordaten 1 und ungelabelte Sensordaten 2, die von Umfeldsensoren eines Kraftfahrzeugs, vorliegend einem LiDAR-Sensor, während einer /Test-) Fahrt mit dem Kraftfahrzeug laufend bzw. kontinuierlich aufgenommen wurden.In a first step S1 of the method, a training data set is provided comprising labeled sensor data 1 and unlabeled sensor data 2, which were continuously recorded by environmental sensors of a motor vehicle, in this case a LiDAR sensor, during a (test) drive with the motor vehicle.

In einem zweiten Schritt S2 des Verfahren erfolgt ein (initiales) Trainieren des KI-Modells 3, das nach dem Trainieren zum Erstellen eines Umfeldmodells 5 basierend auf den Sensordaten geeignet ist, mit den gelabelten Sensordaten 1 und in einem dritten Schritt S3 des Verfahren erfolgt ein Trainieren eines Autoencoders 4. Dazu wird mittels des trainierten Modells 3 ein zunächst als 3D Modell vorliegendes Umfeldmodell 5 basierend auf den gelabelten Sensordaten 1 erstellt (S31), aus diesem basierend auf den gelabelten Sensordaten 1 erstellten Umfeldmodell 5 wird anschließend eine Vogelperspektiven-Ansicht 7 erstellt (S32), woraus wiederum Gitternetz-Repräsentationen 6 zum Trainieren des Autoencoders 4 abgeleitet werden (S33), die dann in den Autoencoder eingegeben werden (S34). Dieses Vorgehen ist anlog zu den Schritte S4 - S7 des Verfahrens, welche nachfolgend im Bezug auf die ungelabelten Sensordaten 2 im Detail beschrieben werden mit dem Unterschied, dass in dem siebten eine Ausgabe einer Ausgabeschicht des Autoencoders 4 nicht weiter verwendet wird, sondern die komprimierten Daten weiter verarbeitet werden.In a second step S2 of the method, the AI model 3, which after training is suitable for creating an environment model 5 based on the sensor data, is (initially) trained with the labeled sensor data 1, and in a third step S3 of the method, an autoencoder 4 is trained. For this purpose, an environment model 5, initially available as a 3D model, is created using the trained model 3 based on the labeled sensor data 1 (S31). A bird's eye view 7 is then created from this environment model 5 created based on the labeled sensor data 1 (S32), from which grid representations 6 are derived for training the autoencoder 4 (S33), which are then input into the autoencoder (S34). This procedure is analogous to steps S4 - S7 of the method, which are described in detail below with reference to the unlabeled sensor data 2, with the difference that in the seventh step an output of an output layer of the autoencoder 4 is no longer used, but the compressed data is further processed.

In einem vierten Schritt S4 des Verfahrens erfolgt ein Erzeugen eines dreidimensionalen Umfeldmodells 5 basierend auf den ungelabelten Sensordaten 2 mittels des (initial) trainierten KI-Modells 3.In a fourth step S4 of the method, a three-dimensional environment model 5 is generated based on the unlabeled sensor data 2 by means of the (initially) trained AI model 3.

Anschließend erfolgt ein Erzeugen von Gitternetz-Repräsentationen 6des basierend auf den ungelabelten Sensordaten 2 mittels des trainierten KI-Modells 3 erzeugten Umfeldmodells 5.Subsequently, grid representations 6 of the environment model 5 generated based on the unlabeled sensor data 2 using the trained AI model 3 are generated.

Dazu erfolgt in einem fünften Schritt S5 des Verfahrens ein Erzeugen einer zweidimensionalen Vogelperspektiven-Ansicht 7 basierend auf dem dreidimensionalen Umfeldmodell 5 und in einem sechsten Schritt S6 des Verfahrens ein Ableiten der Gitternetz-Repräsentationen 6 aus der erzeugten zweidimensionalen Vogelperspektiven-Ansicht 7. Letzteres umfasst ein Ableiten eines Belegungs-Gitternetzes, das Informationen über eine Belegung einzelner das Belegungs-Gitternetz bildender Rechtecke basierend auf der Vogelperspektiven-Ansicht 7, und ein Ableiten jeweils eines Geschwindigkeits-Gitternetzes und eines Beschleunigungs-Gitternetzes für ein X-Richtung und eine Y-Richtung basierend auf der Vogelperspektiven-Ansicht und des Belegungs-Gitternetzes. Die Gitternetz-Repräsentationen 6 werden dabei für das sich über die Zeit verändernde Umfeldmodell 5 bzw. dessen Vogelperspektiven-Ansicht 7 in zeitlich gleich bleibenden Abständen t, t+1 ... t+n abgeleitet.For this purpose, in a fifth step S5 of the method, a two-dimensional bird's eye view 7 is generated based on the three-dimensional environment model 5, and in a sixth step S6 of the method, the grid representations 6 are derived from the generated two-dimensional bird's eye view 7. The latter includes deriving an occupancy grid that contains information about the occupancy of individual rectangles forming the occupancy grid based on the bird's eye view 7, and deriving a speed grid and an acceleration grid for an X direction and a Y direction based on the bird's eye view and the occupancy grid. The grid representations 6 are derived for the environment model 5, which changes over time, or its bird's eye view 7, at constant intervals t, t+1 ... t+n.

In einem siebten Schritt S7 werden die abgeleiteten Gitternetz-Repräsentationen 6 in den (initial) trainierten Autoencoder 4 eingegeben, um komprimierten Repräsentationen der Gitternetz-Repräsentationen 6 mittels des trainierten Autoencoders 4 zu erzeugen. Mit anderen Worten, die Gitternetz-Repräsentationen 6 werden mittels des Autoencoders 4 in den latenten Raum (engl. latent space) überführt. Ein latenter Raum, der auch als latenter Merkmalsraum oder Einbettungsraum bezeichnet wird, ist eine Einbettung einer Menge von Objekten in eine Struktur, in der Objekte, die einander ähneln, im latenten Raum näher beieinander liegen. Die Position innerhalb des latenten Raums kann als durch eine Reihe latenter Variablen definiert angesehen werden, die sich aus den Ähnlichkeiten des Objekts ergeben. Der Autoencoder 4 ist vorliegend als ein Raum-Zeit-Autoencoder ausgeführt, der ausgestaltet ist, um aus den Belegungs-Gitternetzen räumliche Informationen zu extrahieren und aus den Geschwindigkeits-Gitternetzen und den Beschleunigungs-Gitternetzen zeitliche Informationen zu extrahieren.In a seventh step S7, the derived grid representations 6 are input into the (initially) trained autoencoder 4 in order to generate compressed representations of the grid representations 6 by means of the trained autoencoder 4. In other words, the grid representations 6 are transferred to the latent space by means of the autoencoder 4. A latent space, also referred to as a latent feature space or embedding space, is an embedding of a set of objects in a structure in which objects that are similar to one another are closer to one another in the latent space. The position within the latent space can be viewed as being defined by a series of latent variables that result from the similarities of the object. The autoencoder 4 is embodied as a space-time autoencoder which is designed to extract spatial information from the occupancy grids and to extract temporal information from the velocity grids and the acceleration grids.

In einem achten Schritt S8 erfolgt ein Durchführen einer Kernsatzauswahl. Diese kann basierend auf den komprimierten Repräsentationen durchgeführt werden, um so mehrere bzw. ein Set von Sensordaten aus den ungelabelten Sensordaten zum Labeln für ein weiteres Trainieren des KI-Modells 3 auszuwählen. Anschließend kann das Verfahren von neuem gestartet werden bzw. eine neue Iteration durchgeführt werden, wobei die ausgewählten ungelabelten Sensordaten 2 gelabelt werden und zu dem Pool von gelabelten Sensordaten 1 vor dem Beginn des weiteren Durchgangs hinzugefügt werden.In an eighth step S8, a core set selection is carried out. This can be carried out based on the compressed representations in order to select several or a set of sensor data from the unlabeled sensor data for labeling for further training of the AI model 3. The process can then be started again or a new iteration can be carried out, whereby the selected unlabeled sensor data 2 is labeled and added to the pool of labeled sensor data 1 before the start of the next pass.

BezugszeichenlisteList of reference symbols

11: gelabelte Sensordatenlabeled sensor data
22: ungelabelte Sensordatenunlabeled sensor data
33: KI-ModellAI model
44: AutoencoderAutoencoder
55: UmfeldmodellEnvironment model
66: Vogelperspektiven-Ansicht des UmfeldmodellsBird's eye view of the environment model
77: Gitternetz-Repräsentationen Grid representations
S1 - S8S1 - S8: VerfahrensschritteProcess steps

Claims

Computer-implemented method for training a model based on artificial intelligence (AI model) (3) for creating an environment model (5) for an automated motor vehicle, the method comprising: - providing (S1) a training data set comprising labeled and unlabeled sensor data (1, 2) from environment sensors of the motor vehicle, - training (S2) the AI model (3) with the labeled sensor data (1), characterized in that the method comprises: - training (S3) an autoencoder (4) with a grid representation generated based on the labeled sensor data (1) by means of the trained AI model (3), - generating (S4) an environment model (5) based on the unlabeled sensor data (2) by means of the trained AI model (3), - generating grid representations (7) of the environment model (5) generated based on the unlabeled sensor data (2) by means of the trained AI model (3), - generating (S7) compressed representations of the grid representations (6) by means of the trained autoencoder (4), and - performing (S8) a core set selection based on the compressed representations in order to select at least one set of sensor data for labeling for further training of the AI model (3) from the unlabeled sensor data (2).

Computer-implemented method according to Claim 1 , characterized in that the environmental sensors have a LiDAR sensor and the labeled and unlabeled sensor data (1, 2) were at least partially recorded by the LiDAR sensor.

Computer-implemented method according to Claim 2 , characterized in that the generation of the grid representations (6) comprises: - generating (S5) a two-dimensional bird's eye view (7) based on the environment model (5), which is present as a three-dimensional model after its generation by the AI model (3), and - deriving (S6) the grid representations (6) from the generated two-dimensional bird's eye view (7).

Computer-implemented method according to Claim 3 , characterized in that deriving the grid representations (6) from the generated two-dimensional bird's eye view (7) comprises for each grid representation (6): - deriving an occupancy grid which contains information about an occupancy of individual rectangles forming the occupancy grid based on the bird's eye view (7), and - deriving a speed grid and an acceleration grid for an X direction and a Y direction based on the bird's eye view (7) and the occupancy grid.

Computer-implemented method according to Claim 4 , characterized in that the autoencoder (4) comprises a space-time autoencoder which is designed to extract spatial information from the occupancy grid and to extract temporal information from the velocity grid and the acceleration grid.

Computer-implemented method according to one of the Claims 1 until 5 , characterized in that the sensor data were recorded continuously during a journey with the motor vehicle.

Device for data processing, characterized in that the device is designed to carry out the method according to one of the Claims 1 until 6 to execute.

Computer program, characterized in that the computer program comprises instructions which, when the program is executed by a computer, cause the computer to carry out the method according to one of the Claims 1 until 6 to execute.

Computer-readable medium, characterized in that the computer-readable medium comprises instructions which, when executed by a computer, cause the computer to carry out the method according to one of the Claims 1 until 6 to execute.

Use of a device manufactured by the method according to any of the Claims 1 until 6 trained AI model (3) in the context of automated driving of an automated motor vehicle.