Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add seed: RCN list file #3

Open
juzraai opened this issue Apr 28, 2018 · 0 comments
Open

Add seed: RCN list file #3

juzraai opened this issue Apr 28, 2018 · 0 comments
Labels
feature New functionality or improvement good first issue Easier tasks, scut work, etc.

Comments

@juzraai
Copy link
Owner

juzraai commented Apr 28, 2018

What problem this feature would solve (e.g. who needs it and why)

One may collect RCNs by hand into a spreadsheet, it may be difficult to generate a -s rcn1,rcn2,... list.

How would it solve it

It would be easier to export the RCN column or the whole spreadsheet to a text file, then tell the crawler to read RCNs from there.

How do you imagine using the feature (e.g. CLI configuration, output format)

-s file.tsv,1 seed argument would tell the crawler to read file.tsv as a TSV file and search for RCNs in the first column. Let's use 1-based indexes here.

The column index can be optional with default value 1.

We can accept both *.tsv and *.txt file extensions.

The reader would iterate the file line-by-line and would skip non-number values.

@juzraai juzraai added feature New functionality or improvement good first issue Easier tasks, scut work, etc. labels Apr 28, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature New functionality or improvement good first issue Easier tasks, scut work, etc.
Projects
None yet
Development

No branches or pull requests

1 participant