WO2023031879 - HIERARCHICAL CLUSTERING ON GRAPHS FOR TAXONOMY EXTRACTION AND APPLICATIONS THEREOF
National phase entry:
Publication Number
WO/2023/031879
Publication Date
09.03.2023
International Application No.
PCT/IB2022/058284
International Filing Date
02.09.2022
Title **
[English]
HIERARCHICAL CLUSTERING ON GRAPHS FOR TAXONOMY EXTRACTION AND APPLICATIONS THEREOF
[French]
REGROUPEMENT HIÉRARCHIQUE SUR DES GRAPHES POUR L'EXTRACTION DE TAXONOMIE ET SES APPLICATIONS
Applicants **
THOMSON REUTERS ENTERPRISE CENTRE GMBH
Landis + Gyr-Strasse 3
6300 Zug, CH
Inventors
NEFEDOV, Nikolai
Schuetzenhausstrasse 13
8912 Obfelden, CH
VON RICKENBACH, David
Grabenstrasse 3
6340 Baar, CH
Priority Data
63/240,393
03.09.2021
US
17/901,648
01.09.2022
US
Application details
| Total Number of Claims/PCT | * |
| Number of Independent Claims | * |
| Number of Priorities | * |
| Number of Multi-Dependent Claims | * |
| Number of Drawings | * |
| Pages for Publication | * |
| Number of Pages with Drawings | * |
| Pages of Specification | * |
| * | |
| * | |
International Searching Authority |
EPO
* |
| Applicant's Legal Status |
Legal Entity
* |
| * | |
| * | |
| * | |
| * | |
| Entry into National Phase under |
Chapter I
* |
| Translation |
|
Recalculate
* The data is based on automatic recognition. Please verify and amend if necessary.
** IP-Coster compiles data from publicly available sources. If this data includes your personal information, you can contact us to request its removal.
Quotation for National Phase entry
| Country | Stages | Total | |
|---|---|---|---|
| China | Filing | 1460 | |
| EPO | Filing, Examination | 6688 | |
| Japan | Filing | 594 | |
| South Korea | Filing | 608 | |
| USA | Filing, Examination | 3310 |

Total: 12660 USD
The term for entry into the National Phase has expired. This quotation is for informational purposes only
Abstract[English]
Aspects of the present disclosure provide systems, methods, apparatus, and computer-readable storage media for extracting taxonomies based on hierarchical clustering on graphs related to a corpus of documents and using said taxonomies for classifying and labeling documents. The hierarchical clustering of graphs include the adaptive pruning of nodes at each hierarchy based on betweenness centrality of nodes to form clusters that have modularity score exceeding a minimum modularity threshold.[French]
Selon certains aspects, la présente invention concerne des systèmes, des procédés, un appareil et des supports de stockage lisibles par ordinateur pour extraire des taxonomies sur la base d'un regroupement hiérarchique sur des graphes associés à un corpus de documents et utiliser lesdites taxonomies afin de classifier et d'étiqueter des documents. Le regroupement hiérarchique de graphes comprend l'élagage adaptatif de nœuds au niveau de chaque hiérarchie sur la base de la centralité d'interdépendance des nœuds pour former des groupes dont le score de modularité dépasse un seuil de modularité minimal.