IP-Coster | WO2023061465 | METHODS, SYSTEMS, AND MEDIA FOR COMPUTER VISION USING 2D CONVOLUTION OF 4D VIDEO DATA TENSORS

Publication Number WO/2023/061465

Publication Date 20.04.2023

International Application No. PCT/CN2022/125299

International Filing Date 14.10.2022

Title **

[English] METHODS, SYSTEMS, AND MEDIA FOR COMPUTER VISION USING 2D CONVOLUTION OF 4D VIDEO DATA TENSORS

[French] PROCÉDÉS, SYSTÈMES ET SUPPORTS DE VISION ARTIFICIELLE UTILISANT UNE CONVOLUTION 2D DE TENSEURS DE DONNÉES VIDÉO 4D

Applicants **

HUAWEI TECHNOLOGIES CO., LTD.

Inventors

HAJIMOLAHOSEINI, Habib

KUMAR, Kaushal

DENG, Gordon

Priority Data

17/502,588 15.10.2021 US

Application details

Total Number of Claims/PCT	*
Number of Independent Claims	*
Number of Priorities	*
Number of Multi-Dependent Claims	*
Number of Drawings	*
Pages for Publication	*
Number of Pages with Drawings	*
Pages of Specification	*
Sequence Listing	*
Number of Office Actions	*
International Search Report is established	*
International Searching Authority	CNIPA *
Recordal of a Change of the Applicant's Name/Address	Change of Applicant's Name and Address *
Type of Assignment	The Standard Agent's Assignment *
Applicant's Legal Status	Legal Entity *
Small Entity	*
Non-Commercial Organization	*
Micro Entity	*
Small Entity, USA	*
Micro Entity, USA	*
Entry into National Phase under	Chapter I *
Patent Delivery	Send the Letters Patent by Courier *
Translation

* The data is based on automatic recognition. Please verify and amend if necessary.

** IP-Coster compiles data from publicly available sources. If this data includes your personal information, you can contact us to request its removal.

Quotation for National Phase entry

Country	Stages	Total
China	Filing, Examination, Granting	2041
EPO	Filing, Examination, Granting	14728
Japan	Filing, Examination, Granting	2307
South Korea	Filing, Examination, Granting	2463
USA	Filing, Examination, Granting	5340

Total: 26,879

The term for entry into the National Phase has expired. This quotation is for informational purposes only

Contact Us

Abstract[English] Methods, systems and media for computer vision using 2D convolution of 4D video data tensors are described. 3D convolution operations performed on 5D input tensors are simulated by performing 2D convolution of 4D tensors instead. A convolution block of a CNN performs two parallel operations: a spatial processing branch performs spatial feature extraction on a 4D tensor using 2D convolution, whereas a temporal processing branch performs temporal feature extraction on a different 4D tensor using 2D convolution. The output tensors of the spatial processing branch and the temporal processing branch are combined to generate an output tensor of the convolution block. The convolution block may include additional operations such as reshaping and/or further convolution operations to generate identically-sized output tensors for each branch, thereby eliminating the need for post- processing of the branches' output tensors prior to combining them.[French] L'invention concerne des procédés, des systèmes et des supports de vision artificielle utilisant une convolution 2D de tenseurs de données vidéo 4D. Des opérations de convolution 3D effectuées sur des tenseurs d'entrée 5D sont simulées en réalisant à la place une convolution 2D de tenseurs 4D. Un bloc de convolution d'un CNN effectue deux opérations parallèles : une branche de traitement spatial réalise une extraction de caractéristiques spatiales sur un tenseur 4D au moyen d'une convolution 2D, tandis qu'une branche de traitement temporel effectue une extraction de caractéristiques temporelles sur un tenseur 4D différent au moyen d'une convolution 2D. Les tenseurs de sortie de la branche de traitement spatial et de la branche de traitement temporel sont combinés pour générer un tenseur de sortie du bloc de convolution. Le bloc de convolution peut comprendre des opérations supplémentaires telles qu'un remodelage et/ou d'autres opérations de convolution pour générer des tenseurs de sortie de taille identique pour chaque branche, ce qui permet d'éliminer la nécessité d'un post-traitement des tenseurs de sortie des branches avant de les combiner.

WO2023061465 - METHODS, SYSTEMS, AND MEDIA FOR COMPUTER VISION USING 2D CONVOLUTION OF 4D VIDEO DATA TENSORS

Quotation for National Phase entry

Contact Us