IP-Coster | WO2025248476 | EVENT-BASED REINFORCEMENT LEARNING FOR RRM PARAMETER OPTIMIZATION

Publication Number WO/2025/248476

Publication Date 04.12.2025

International Application No. PCT/IB2025/055537

International Filing Date 28.05.2025

Title **

[English] EVENT-BASED REINFORCEMENT LEARNING FOR RRM PARAMETER OPTIMIZATION

[French] APPRENTISSAGE PAR RENFORCEMENT BASÉ SUR DES ÉVÉNEMENTS POUR OPTIMISATION DE PARAMÈTRES RRM

Applicants **

NOKIA TECHNOLOGIES OY

Inventors

SONG, Jian

FEKI, Afef

HÖHNE, Hans Thomas

ALI-TOLPPA, Janne

VEIJALAINEN, Teemu Mikael

DOSTI, Endrit

ALI, Samad

KHATIBI, Sina

Priority Data

20245701 31.05.2024 FI

Application details

Total Number of Claims/PCT	*
Number of Independent Claims	*
Number of Priorities	*
Number of Multi-Dependent Claims	*
Number of Drawings	*
Pages for Publication	*
Number of Pages with Drawings	*
Pages of Specification	*
Sequence Listing	*
Number of Office Actions	*
International Search Report is established	*
International Searching Authority	EPO *
Recordal of a Change of the Applicant's Name/Address	Change of Applicant's Name and Address *
Type of Assignment	The Standard Agent's Assignment *
Applicant's Legal Status	Legal Entity *
Small Entity	*
Non-Commercial Organization	*
Micro Entity	*
Small Entity, USA	*
Micro Entity, USA	*
Entry into National Phase under	Chapter I *
Patent Delivery	Send the Letters Patent by Courier *
Translation

* The data is based on automatic recognition. Please verify and amend if necessary.

** IP-Coster compiles data from publicly available sources. If this data includes your personal information, you can contact us to request its removal.

Quotation for National Phase entry

Country	Stages	Total
China	Filing, Examination, Granting	2361
EPO	Filing, Examination, Granting	9431
Japan	Filing, Examination, Granting	2183
South Korea	Filing, Examination, Granting	2020
USA	Filing, Examination, Granting	4740

Total: 20,735

Contact Us

Abstract[English] According to an aspect, there is provided an apparatus for performing the following. The apparatus transmits, to a network entity, a configuration request requesting configuration of one or more reinforcement learning, RL, strategies for an RL model. The apparatus receives, from the network entity, at least one configuration message comprising the one or more RL strategies which comprise one or more RL event conditions for entering and/or exiting exploration and/or exploitation events. The apparatus performs exploration and/or exploitation using the RL model. The apparatus evaluates the one or more RL event conditions and transmits, to the network entity, an evaluation report comprising results of the evaluating. The apparatus receives, from the network entity, a positive or negative acknowledgement. Based on the reception of the positive acknowledgment and the results, the apparatus performs triggering an exploration or exploitation event and/or exiting an exploration or the exploitation event.[French] Selon un aspect, l'invention concerne un appareil pour réaliser ce qui suit. L'appareil transmet, à une entité de réseau, une demande de configuration demandant la configuration d'une ou de plusieurs stratégies d'apprentissage par renforcement (RL) pour un modèle RL. L'appareil reçoit, en provenance de l'entité de réseau, au moins un message de configuration comprenant la ou les stratégies RL qui comprennent une ou plusieurs conditions d'événement RL pour entrer et/ou sortir des événements d'exploration et/ou d'exploitation. L'appareil effectue une exploration et/ou une exploitation à l'aide du modèle RL. L'appareil évalue la ou les conditions d'événement RL et transmet, à l'entité de réseau, un rapport d'évaluation comprenant des résultats de l'évaluation. L'appareil reçoit, en provenance de l'entité de réseau, un accusé de réception positif ou négatif. Sur la base de la réception de l'accusé de réception positif et des résultats, l'appareil effectue le déclenchement d'un événement d'exploration ou d'exploitation et/ou la sortie d'une exploration ou de l'événement d'exploitation.

WO2025248476 - EVENT-BASED REINFORCEMENT LEARNING FOR RRM PARAMETER OPTIMIZATION

Quotation for National Phase entry

Contact Us