Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DCAT parsing issue on carriage returns #149

Open
stijnvanhoey opened this issue Oct 23, 2017 · 2 comments
Open

DCAT parsing issue on carriage returns #149

stijnvanhoey opened this issue Oct 23, 2017 · 2 comments

Comments

@stijnvanhoey
Copy link
Contributor

After the adaptations on the IPT, we still have issues on the carriage returns of the description fields. The reason is the requirement to use the 'next paragraph' options whenever a new paragraph (basically any enter) is required.

This results in parsing issues with the dcat feed towards the open data Vlaanderen (and Belgium), resulting in not appearing in the listing.

The dcat feed output with the errors is provides issues in the data sets PINK (amphibia), Dung beetles and Alien macro-invertebrates.

We can check the well-functioning of the dcat feed with http://id.vlaanderen.be/vodap_validator
@DimEvil @milotictanja @peterdesmet @LienReyserhove
Whenever we add new datasets, make sure to do a quick check on the validator by pasting the URL of the INBO dcat-feed (http://data.inbo.be/ipt/dcat) into the validator.

@stijnvanhoey
Copy link
Contributor Author

stijnvanhoey commented Oct 23, 2017

I adapted the metadata for the data sets with the paragraph-issue and we do not have any parsing issues anymore. The result of the vodap_validator is: http://id.vlaanderen.be/vodap_validatorresults/2017-10-23T11:14:21Z/

The parsing issues are solved, but we still have some other errors in the report:

  1. Catalog - error - dct:issued should be a literal typed as date or dateTime.
  2. Catalog - error - dct:modified should be a literal typed as date or dateTime.

See xsd:datetime for the required format.
It is a matter of explicitly define the value: "2012-09-17T17:01+02:00" ^^ xsd:dateTime or
"1999-08-16"^^xsd:date. In other words, adding this to the date field, see also https://www.infowebml.ws/rdf-owl/Literal.htm

and some warnings:

  1. License Document - warning - dct:LicenseDocument does not exist.
  2. Agent - warning - dct:type is a recommended property for Agent.

@peterdesmet we should probably tackle these on the IPT implementation level?

@peterdesmet
Copy link
Member

Yes, we should tackle these on an IPT implementation level. Maybe interesting to do as a 1 day project for a Java dev... or we could try ourselves.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants