<article xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en"><front><journal-meta><journal-id journal-id-type="publisher">episciences.org</journal-id><journal-title-group><journal-title>ElPub - ELectronic PUBlishing</journal-title><abbrev-journal-title>ELPUB</abbrev-journal-title></journal-title-group><publisher><publisher-loc><email>support@episciences.org</email><uri>https://www.episciences.org</uri><uri>https://elpub.episciences.org</uri></publisher-loc></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4000/proceedings.elpub.2020.1</article-id><article-id pub-id-type="hal">hal-02544245</article-id><article-id pub-id-type="publisher-id">http://elpub.episciences.org/6307</article-id><article-catgories><series-text content-type="text">Short Papers</series-text></article-catgories><title-group><article-title xml:lang="en">Open science-based framework to reveal open data publishing: an experience from using Common Crawl</article-title></title-group><contrib-group><contrib contrib-type="author"><name><surname>Correa</surname><given-names>Andreiwid</given-names></name><institution-wrap><institution><institution_id type="ror">https://ror.org/005pn5z34</institution_id><institution_name>Federal Institute of São Paulo</institution_name></institution></institution-wrap></contrib><contrib contrib-type="author"><name><surname>Fernandes</surname><given-names>Israel</given-names></name><institution-wrap><institution><institution_id type="ror">https://ror.org/005pn5z34</institution_id><institution_name>Federal Institute of São Paulo</institution_name></institution></institution-wrap></contrib></contrib-group><pub-date pub-type="epub"><day>18</day><month>04</month><year>2020</year></pub-date><volume>Charting The Futures(s) of Digital Publishing</volume><uri specific-use="for-review">http://elpub.episciences.org/6307/pdf</uri><self-uri>http://elpub.episciences.org/6307</self-uri><abstract xml:lang="en"><p>The publishing of open data is considered a key element for civic participation paving the way tothe ‘public value’, a term which underpins the social contribution. A result of that can be seenthrough the popularity of data portals published all around the world by governments, publicand private organizations. However, the diffusion of data portals raises concerns aboutdiscoverability and validity of these data sources, especially to what extent they contribute toopen data and open science. The purpose of this work is to develop a framework to reveal opendata publishing with the use of a freely available open science project called Common Crawl. Theidea is to identify open data-related initiatives and to gather information about their availability,having in the framework’s essence an iterative and differential process. The main outcome isshown through a proposed model for the historical data repository which involves both use andcreation of open science to branch new sort of research possibilities based on publishing ofderived data.</p></abstract><kwd-group kwd-group-type="author" xml:lang="en"><kwd>open data</kwd><kwd>open science</kwd><kwd>common crawl</kwd><kwd>data portals</kwd><kwd>[SHS.INFO]Humanities and Social Sciences/Library and information sciences</kwd></kwd-group><permissions><copyright-year>2020</copyright-year><copyright-holder>The Author(s)</copyright-holder><license license-type="open-access" xlink:href="https://about.hal.science/hal-authorisation-v1"/></permissions><counts><page-count count="10"/></counts></article-meta></front><body/></article>