IP-Coster | WO2024246642 | TRACKING CONTENT WITH ARTIFICAL INTELLIGENCE AS IT IS CONSUMED

Publication Number WO/2024/246642

Publication Date 05.12.2024

International Application No. PCT/IB2024/054533

International Filing Date 09.05.2024

Title **

[English] TRACKING CONTENT WITH ARTIFICAL INTELLIGENCE AS IT IS CONSUMED

[French] SUIVI DE CONTENU AVEC INTELLIGENCE ARTIFICIELLE AU FUR ET À MESURE DE SA CONSOMMATION

Applicants **

SONY GROUP CORPORATION 1-7-1 Konan Minato-ku Tokyo 108-0075, JP

Inventors

CANDELORE, Brant 16535 Via Esprillo San Diego, California 92127, US

Priority Data

18/328,533 02.06.2023 US

Application details

Total Number of Claims/PCT	*
Number of Independent Claims	*
Number of Priorities	*
Number of Multi-Dependent Claims	*
Number of Drawings	*
Pages for Publication	*
Number of Pages with Drawings	*
Pages of Specification	*
Sequence Listing	*
International Search Report is established	*
International Searching Authority	EPO *
Applicant's Legal Status	Legal Entity *
Small Entity	*
Non-Commercial Organization	*
Small Entity, USA	*
Micro Entity, USA	*
Entry into National Phase under	Chapter I *
Translation

* The data is based on automatic recognition. Please verify and amend if necessary.

** IP-Coster compiles data from publicly available sources. If this data includes your personal information, you can contact us to request its removal.

Quotation for National Phase entry

Country	Stages	Total
China	Filing	1264
EPO	Filing, Examination	6260
Japan	Filing	589
South Korea	Filing	573
USA	Filing, Examination	2710

Total: 11,396 USD

The term for entry into the National Phase has expired. This quotation is for informational purposes only

Abstract[English] A user starts playback of an audio or AV program, which starts up a large language model (LLM) (210, 220, 304, 404, 500) such as a generative pre-trained transformer. The LLM follows the program with the user as the program is being watched. Audio is converted to text and video is converted to text-based description. The LLM essentially trains on the resulting "corpus" so that should the user subsequently want to access part of the program using a simple conversational query about the program, the LLM can provide the answer.[French] L'invention se rapporte à un utilisateur qui démarre la lecture d'un programme audio ou audiovisuel, qui démarre un grand modèle de langage (LLM) (210, 220, 304, 404, 500) tel qu'un transformateur pré-entraîné génératif. Le LLM suit le programme avec l'utilisateur lorsque le programme est regardé. Un audio est converti en texte et une vidéo est convertie en une description à base de texte. Le LLM s'entraîne essentiellement le « corpus » résultant, de sorte que si l'utilisateur souhaite ultérieurement accéder à une partie du programme au moyen d'une simple interrogation conversationnelle concernant le programme, le LLM puisse fournir la réponse.

WO2024246642 - TRACKING CONTENT WITH ARTIFICAL INTELLIGENCE AS IT IS CONSUMED

Quotation for National Phase entry