Aller au contenu principalAller à la recherche
Episciences
Revues en libre accès
ElPub - ELectronic PUBlishing logo
ElPub - ELectronic PUBlishing
ElPub - ELectronic PUBlishing logo
ElPub - ELectronic PUBlishing
Articles & volumes
Tous les articlesTous les volumesDernier volumeActes de conférencesAuteurs/Autrice
À propos
À propos
Comités
Publier
Pour les auteurs/autrices
ElPub - ELectronic PUBlishing logo
Contact
|
Crédits
RSS
|
Atom
Episciences
Documentation
|
Remerciements
|
Politique de publication
Accessibilité : non conforme
|
Mentions légales
|
Déclaration de confidentialité
|
Termes d'utilisation
  1. Accueil > Articles & volumes >
  2. Articles >
  3. Automatic Subject In ...
Communication dans un congrès

Automatic Subject Indexing and Classification Using Text Recognition and Computer-Based Analysis of Tables of Contents

Jan Pokorny (1)
(1) ENKI, o.p.s.
Télécharger l'article
Ouvrir sur HAL
Détails de publication
Soumis le
June 20, 2018
Accepté le
June 20, 2018
Publié le
June 20, 2018
Modifié le
March 31, 2025
Acte de conférence 1
Connecting the Knowledge Commons: From Projects to Sustainable Infrastructure
Long Papers
DOI
10.4000/proceedings.elpub.2018.19
Licence
Attribution 4.0 International (CC BY 4.0)
Indicateurs
385
Vues
1196
Téléchargements

Automatic Subject Indexing and Classification Using Text Recognition and Computer-Based Analysis of Tables of Contents

Jan Pokorny (1)
(1) ENKI, o.p.s.
Abstract
This paper will describe a method for machine-based creation of high quality subject indexing and classification for both electronic and print documents using tables of contents (ToCs). The technology described here is primarily focused on electronic and print documents for which, because of technical or licensing reasons, it is not possible to index full text. However, the technology would also be useful for full text documents, because it could significantly enhance the accuracy and relevance of subject description by analyzing the structure of ToCs.
Mots-clés
français
  • [SHS.INFO]Humanities and Social Sciences/Library and information sciences
anglais
  • machine learning system
  • computer-generated keywords
  • library automatization
  • text mining
  • computer-generated subject headings
Cité par

Source : OpenCitations

  • Hierarchical Multi-Label Classification of Library Subject Headings

    2022 International Conference on Cybernetics and Innovations (ICCI)

    Auteurs/Autrices : Worrawan Wandee, Pokpong Songmuang ORCID

    Référence de la revue : Volume 8, 2022, pp. 1-5

    DOI : 10.1109/icci54995.2022.9744189
  • Automated Subject Indexing of Domain Specific Collections Using Word Embeddings and General Purpose Thesauri

    Communications in computer and information science

    Auteurs/Autrices : Michalis Sfakakis ORCID, Leonidas Papachristopoulos ORCID, Kyriaki Zoutsou ORCID, Giannis Tsakonas ORCID, Christos Papatheodorou ORCID

    Référence de la revue : Volume , 2019, pp. 103-114

    DOI : 10.1007/978-3-030-36599-8_9
Aperçu
Loading PDF preview...