Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some *.pdf files are HTML documents / 302 redirections or 403 forbidden #2

Open
WildCryptoFox opened this issue Mar 14, 2021 · 0 comments

Comments

@WildCryptoFox
Copy link

WildCryptoFox commented Mar 14, 2021

(Not an issue for me. Just a notification.)

While searching for redirection and forbidden pages in my own archive directory (which transitively includes your library), I noticed you've hit the same annoying issue.

$ find -type f -name '*.pdf' -exec file {} + | grep -v PDF
./machine learning/Learning to Protect Communications Using Adverarial Neural Cryptography (2016) - Abadi, Andersen.pdf:                                                                                                                                   HTML document, ASCII text
./music theory & sheet music/Op. 27 No. 2: Sonata quasi una Fantasia - Beethoven.pdf:                                                                                                                                                                      HTML document, UTF-8 Unicode text
./maps/cards/amharic.pdf:                                                                                                                                                                                                                                  HTML document, ASCII text
./maps/cards/georgia.pdf:                                                                                                                                                                                                                                  HTML document, ASCII text
./maps/cards/nepali.pdf:                                                                                                                                                                                                                                   HTML document, ASCII text
./maps/cards/ukranian.pdf:                                                                                                                                                                                                                                 HTML document, ASCII text
./physics/HEP theory/Can We Mine a Black Hole? (2015) [SciAm] - Brown.pdf:                                                                                                                                                                                 HTML document, UTF-8 Unicode text, with very long lines
./cryptography & mathematics/quantum algorithms & cryptanalysis/Breaking Symmetric Cryptosystems using Quantum Period Finding (2016) - Kaplan, Leurent, Leverrier, Naya-Placencia.pdf:                                                                     HTML document, ASCII text
./cryptography & mathematics/hashes/keccak/First Analysis of Keccak (2009) - Aumasson, Khovratovich.pdf:                                                                                                                                                   HTML document, ASCII text
./cryptography & mathematics/stream ciphers/All Your Biases Belong To Us: Breaking RC4 in WPA-TKIP and TLS (2015) - Vanhoef, Piessens.pdf:                                                                                                                 HTML document, ASCII text, with very long lines
./cryptography & mathematics/isogeny-based cryptography/Constructing elliptic curve isogenies in quantum subexponential time (2011) - Childs, Jao, Soukharev.pdf:                                                                                          HTML document, ASCII text
./cryptography & mathematics/zero knowledge/Proof Systems for General Statements about Discrete Logarithms (1997) - Camenisch, Stadler.pdf:                                                                                                                HTML document, ASCII text, with CRLF, LF line terminators
./anonymity & circumvention/bitcoin/Mixcoin: Anonymity for Bitcoin with accountable mixes (2014) - Bonneau, Narayanan, Miller, Clark, Kroll, Felten.pdf:                                                                                                   HTML document, ASCII text
./anonymity & circumvention/dpi/Empirical Study of a National-Scale Distributed IDS: Backbone-Level Filtering of HTML Responses in China (2010) - Park, Crandall.pdf:                                                                                      PDF document, version 1.4

The following are all arxiv.org 302 redirections, thus easy fixes. The maps/cards/*.pdf appear to be dead links that not even archive.org caught. I didn't investigate the rest further.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant