-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug: Fails to parse list of URLs txt file #968
Comments
Are you sure your URLs have schemes at the front? They have to be fully qualified URLs (e.g. Can you post a redacted snippet of your actual urls.txt file? You can also try and force a specific parser with |
Yep, can confirm that file has fully qualified URLs.
Sure.
I've tried setting parser explicitly as you suggested, none of them picked up the URLs, with slightly varying errors.
Contents of I can confirm that using input redirection does work fine, so this works: |
Try with |
Doesn't seem to change the error
|
I've also tried this using on a fresh docker image based installation and it fails similarly:
/tmp/ff/urls.txt being the same simple file:
|
Ah sorry I forgot I removed loading directly from a file path in a previous version because it conflicted with the new I'll reopen and merge your original PR #967. For future reference stdin redirection is indeed necessary, or passing |
Describe the bug
I can't seem to get archivebox to add any URLs from simple txt file with a newline separated list of URLs.
Based on error message it fails to parse it. I may be doing something wrong.
Steps to reproduce
archivebox add /tmp/urls.txt
Screenshots or log output
Here's the output I get:
ArchiveBox version
The text was updated successfully, but these errors were encountered: