WO2024246642 - TRACKING CONTENT WITH ARTIFICAL INTELLIGENCE AS IT IS CONSUMED

National phase entry is expected:
Publication Number WO/2024/246642
Publication Date 05.12.2024
International Application No. PCT/IB2024/054533
International Filing Date 09.05.2024
Title **
[English] TRACKING CONTENT WITH ARTIFICAL INTELLIGENCE AS IT IS CONSUMED
[French] SUIVI DE CONTENU AVEC INTELLIGENCE ARTIFICIELLE AU FUR ET À MESURE DE SA CONSOMMATION
Applicants **
SONY GROUP CORPORATION 1-7-1 Konan Minato-ku Tokyo 108-0075, JP
Inventors
CANDELORE, Brant 16535 Via Esprillo San Diego, California 92127, US
Priority Data
18/328,533   02.06.2023   US
front page image
Application details
Total Number of Claims/PCT *
Number of Independent Claims *
Number of Priorities *
Number of Multi-Dependent Claims *
Number of Drawings *
Pages for Publication *
Number of Pages with Drawings *
Pages of Specification *
*
*
International Searching Authority
*
Applicant's Legal Status
*
*
*
*
*
Entry into National Phase under
*
Translation

Recalculate

* The data is based on automatic recognition. Please verify and amend if necessary.

** IP-Coster compiles data from publicly available sources. If this data includes your personal information, you can contact us to request its removal.

Quotation for National Phase entry

Country StagesTotal
China Filing1243
EPO Filing, Examination6281
Japan Filing594
South Korea Filing575
USA Filing, Examination2710
MasterCard Visa

Total: 11403

The term for entry into the National Phase has expired. This quotation is for informational purposes only

Abstract[English] A user starts playback of an audio or AV program, which starts up a large language model (LLM) (210, 220, 304, 404, 500) such as a generative pre-trained transformer. The LLM follows the program with the user as the program is being watched. Audio is converted to text and video is converted to text-based description. The LLM essentially trains on the resulting "corpus" so that should the user subsequently want to access part of the program using a simple conversational query about the program, the LLM can provide the answer.[French] L'invention se rapporte à un utilisateur qui démarre la lecture d'un programme audio ou audiovisuel, qui démarre un grand modèle de langage (LLM) (210, 220, 304, 404, 500) tel qu'un transformateur pré-entraîné génératif. Le LLM suit le programme avec l'utilisateur lorsque le programme est regardé. Un audio est converti en texte et une vidéo est convertie en une description à base de texte. Le LLM s'entraîne essentiellement le « corpus » résultant, de sorte que si l'utilisateur souhaite ultérieurement accéder à une partie du programme au moyen d'une simple interrogation conversationnelle concernant le programme, le LLM puisse fournir la réponse.
An unhandled error has occurred. Reload 🗙