Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode Error #7

Open
hangerjj opened this issue Apr 19, 2023 · 0 comments
Open

Unicode Error #7

hangerjj opened this issue Apr 19, 2023 · 0 comments

Comments

@hangerjj
Copy link

Has anyone seen this error before? The file has hundreds of thousands of games but I'm getting a Unicode error when running pgn2data. What I've tired so far is the following. Before I manually look at the pgn file with Scid, any other ideas what could be causing this?

iconv-Linux tool to change the encoding but it fails.

pgn-extract-A pgn command line tool to clean pgn files. Still getting the Unicode error.

I thought about creating a python script in attempt to change the encoding but the solutions I researched were working with the read.csv tool in pandas so I thought that'd be incorrect

OS is Debian 11 Bullseye

Example of error. Had multiple position #'s.

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x94 in position 5478: invalid start byte

Example of iconv error. Sequence position has varied.

iconv: illegal input sequence at position 2635110

Example of the code I was using.


from converter.pgn_data import PGNData

pgn_data = PGNData("multiplegames.pgn")
pgn_data.export()
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant