Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

recordedBy values contain HTML content #522

Open
nickdos opened this issue Feb 25, 2022 · 0 comments
Open

recordedBy values contain HTML content #522

nickdos opened this issue Feb 25, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@nickdos
Copy link

nickdos commented Feb 25, 2022

Related to #521

Viewing facet results for recordedBy field for Questagame records show that we are indexing HTML content in this field, which results in unexpected output when sorting by value (vs count). The IU is stripping out HTML so the user is only seeing the text portion but the ordering of results is still seeing the HTML content and thus all Questagame collector names appear jumbled up together because they all start with <a href.

{
  "i18nCode": "collector.<a href='https://bee.questagame.com/#/profile/12478?questagame_user_id=12478'>frond | questagame.com</a>",
  "count": 41,
  "label": "<a href='https://bee.questagame.com/#/profile/12478?questagame_user_id=12478'>frond | questagame.com</a>",
  "fq": "collector:\"<a href='https://bee.questagame.com/#/profile/12478?questagame_user_id=12478'>frond | questagame.com</a>\""
}

Fix is to strip out HTML using a SOLR filter or via code (SOLR probably has this functionality built-in so suggest not reinventing the wheel here) for the recordedBy field only and rely on the raw_recordedBy to display the HTML version if required.

@nickdos nickdos added the bug Something isn't working label Feb 25, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant