What is Open Data Certificate?
Open Data Certificate is a free online tool developed and maintained by the Open Data Institute, to assess and recognise the sustainable publication of quality open data.
Why is Open Data Certificate useful?
We are working to build trust in open data to support its reuse. Open Data Certificate is an independent quality mark benchmarked against a set of tried and tested standards, to professionalise open data publication. It covers the legal, practical, technical and social requirements to support the use of sustainable open data.
How does Open Data Certificate work?
A questionnaire for data publishers assesses the level of support and steps taken to make data reusable and discoverable. Depending on the answers, we award a badge that can be embedded on the publisher’s website to signify the quality level of their publication process to reusers.
How is Open Data Certificate different to existing standards such as opendefinition.org, 5* of Open Data, Sunlight principles, DCAT etc?
Open Data Certificate builds upon these existing standards. It provides a clear assessment of how well the data meets best practices, and provides data publishers with guidance on how to improve their data for optimal reuse.
What are the benefits of Open Data Certificate to publishers?
- To be recognised for your open data publishing achievements
- To engage with the reuser community
- To build trust and sustainability in the use of open data
- To be guided through best publishing practices
- To acquire a free report on how to improve open data publishing
What are the benefits of Open Data Certificate to reusers?
Level of dependability - we ask publishers key questions about copyright, timeliness, how long this dataset will be supported and updated as well as encouraging thorough documentation.
Adherence to best practices - Higher certificate levels indicate greater commitments from publisher’s to ensure their data is reliable, helping re-users identify quality open data.
Independent assessment - Open Data Certificate is independent of any particular data publishing platform or metadata standard. We combine global best practices to ensure that data is published sustainably with the information needed for reuse.
Can certificates be automatically generated?
If you’re publishing open data in a machine readable way (for example, by using data portal software like ckan or OpenDataSoft) we can pre-fill many of the certificate questions. Simply get in touch with us to get started.
We recommend that you still review and add any additional information about your communication and privacy practices along with other documentation.
Why has my data only achieved level [bronze / silver / gold]? We’ve been publishing data for years!
A Bronze level badge is a fantastic achievement - it means your data is re-use ready! However, we want to encourage publishers to aim higher and push for better internal publishing practices. You do not need to aim for Gold and Platinum for all of your datasets.
What level should my data be achieving?
If the certificate is for a core dataset - produced as part of the main function of your organisation - then we recommend you aim for Gold or Platinum level. This is to reflect the level of investment your organisation has made for this data and communicate to reusers its level of reliability. If this is a singular release, Bronze is perfectly fine. Silver is the general level for great quality open data publication.
Who is using Open Data Certificate?
More and more governments and organisations from around the world are using Open Data Certificates every day. Current users include:
- UK national government (data.gov.uk)
- Greater London Authority (data.london.gov.uk)
- Australia Queensland (data.qld.gov.au)
- Office of National Statistics
- UK Data Service
What does the Open Data Certificate cover? Does it tell me about the quality of the data itself?
Open Data Certificate primarily assesses metadata and the surrounding publication process (privacy, copyright, communication, whether the data will be regularly updated and how). Currently there is no direct validation of the data itself although we assess whether appropriate formats are used.
In the future we may integrate with tools such as CSVlint to help validate data against a schema and judge its quality.
How reliable are these certificates if they are self certified?
Many of the steps have built in automatic validation (for example, testing links) and we have support for community validation - external members can flag issues and there is an independent process to check and update certificates in response to any concerns.
Going forward, we will offer independent ODI auditing for certification to provide the highest standard of trust.
How do you ensure the legal aspects are relevant across different jurisdictions?
We have a localisation team that works with governments and NGOs in different jurisdictions to adapt the Open Data Certificate questionnaire. Certificate localisations reflect different legal defaults and standards used by publishers.
How do you demonstrate that the data has been certified?
As a publisher, once you have completed the certification process (or reviewed and claimed an automatically prefilled certificate) and awarded a badge, you can embed this in your dataset or webpage that links to the full certificate. We have technical support and tools to help you do this throughout the process.
I already use data portal software, can certificates be automatically generated from it?
By using a portal, much of the information can be automatically prefilled. We are working with all major portal software providers to provide this seamlessly.
Open Data Certificate is an independent standard which works across portal providers and captures key additional information about the re-usability quality of your datasets, such as the level of comms support, update frequency and X. We recommend that you still review and add answers after the automated prefilling process.
Do certificates go out of date?
Certificates show the certification date so re-users can judge when this was last updated. Unlike portal metadata, our certificates go out of date and need to be renewed and updated.
We plan to track the currentness of certificates and communicate when they need to be updated. For example, if the dataset has changed, the certificate will no longer apply until it is updated, or if we know from the questionnaire that the certificate is due to expire within a year, the certificate will reflect this.
What is auto-certification?
A free tool that allows any data publishers, aggregators or portal owners to quickly and easily create Open Data Certificates for numerous datasets in one go.
We use existing metadata stored in a data portal to fill in as many certificate fields as possible, ensuring that publishers can focus on adding extra information not already stored.
Auto-certification currently supports metadata stored in CKAN portals or sites with metadata in DCAT. The auto-certification process only creates certificates at Bronze level. Please get in touch if you would like your data portal auto-certified.