Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error when running fs check #2

Closed
censey opened this issue Jan 25, 2024 · 5 comments
Closed

Error when running fs check #2

censey opened this issue Jan 25, 2024 · 5 comments

Comments

@censey
Copy link

censey commented Jan 25, 2024

Hello,
I am just trying to test the current version on a few documents. This is the outputs I see after the files are processed. Just after "Now, lets look at findings!"

Traceback (most recent call last):
File "/opt/homebrew/bin/hawk_scanner", line 8, in
sys.exit(main())
^^^^^^
File "/opt/homebrew/lib/python3.11/site-packages/hawk_scanner/main.py", line 116, in main
records_mini = ', '.join(result['matches']) if len(result['matches']) < 25 else ', '.join(result['matches'][:25]) + f" + {len(result['matches']) - 25} more records"
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: sequence item 0: expected str instance, tuple found

@rohitcodergroww
Copy link

Hi @censey thanks for using Hawk-eye

Can you run this with --debug flag and post output or screenshot here

@censey
Copy link
Author

censey commented Jan 26, 2024

Its taking a really long time to run but is parsing the various patterns. I will try to wait it out. Been hours for a single 1mb file.

@rohitcoder
Copy link
Owner

@censey This shouldn't be happening, i just checked this with bunch of pdfs, zip and docx. It's working fine with latest version. Can you tell me what kind of file you are scanning and possibly screenshot or logs with --debug flag?

@censey
Copy link
Author

censey commented Jan 27, 2024

Ok. I removed the xlsx file I was testing with and added a very simple word doc. That ran flawlessly. Something in this xls was causing all kinds of issues. It was a large document with several tabs and 2k plus columns. Some nested json in some of the fields too.

@rohitcoder
Copy link
Owner

Hey @censey

Let me know, if you still find this issue relevant, then I'll look into this. Otherwise, I will be closing this. If possible please attach a sample file for debugging purpose, after removing sensitive data from it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants