Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Move away from using evaluate #7

Open
BramVanroy opened this issue Jun 18, 2023 · 0 comments
Open

Move away from using evaluate #7

BramVanroy opened this issue Jun 18, 2023 · 0 comments
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@BramVanroy
Copy link
Owner

BramVanroy commented Jun 18, 2023

For TER, ChrF, BLEU I use sacrebleu, which works well. For the other metrics, I use evaluate. This works well as well were it not that the support of the library has (understandably) been lacking the last months due to different priorities at hugging face with the massive increase in interest in generative models.

For MATEO this is becoming problematic, though. As an example: evaluate still uses COMET v1.1.3. A PR was submitted to fix this (huggingface/evaluate#443) but it has not been committed. The last version release of evaluate dates from December 13, 2022, which indicates that for quick updates or new metrics, it is not reliable for our use-case at the moment.

Moving away from evaluate gives more freedom but also means that more manual implementation work needs to be done. However, using evaluate still means that it is relatively easy to use/implement any metric that is implemented in evaluate.

This is a low priority issue and serves as a reminder to myself and/or to start a discussion on the pros/cons.

@BramVanroy BramVanroy added enhancement New feature or request help wanted Extra attention is needed labels Jun 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant