-
-
Notifications
You must be signed in to change notification settings - Fork 32.5k
Closed
Labels
stdlibPython modules in the Lib dirPython modules in the Lib dirtype-bugAn unexpected behavior, bug, or errorAn unexpected behavior, bug, or error
Description
Bug report
Currently the following is False
from urllib.request import Request, urlopen
len(urlopen(Request("http://google.com", method="HEAD")).read()) == 0 # False
But this is True
len(urlopen(Request("http://www.google.com", method="HEAD")).read()) == 0 # True
This is because http://google.com
redirects with 302 to http://www.google.com
.
This means that checking for existence of some file by URL will actually download the file when the URL responds with a redirect. This makes no sense. Also the HTTP spec says nothing about changing HEAD requests into GET requests; it just says that everything but GET and HEAD requests should require user interaction on redirect, which Python violates, but there's a comment on that explaining it's an active choice to violate the spec there.
To me it seems like this is an oversight. Note that curl -LI http://google.com
also sticks to HEAD requests.
Linked PRs
Metadata
Metadata
Assignees
Labels
stdlibPython modules in the Lib dirPython modules in the Lib dirtype-bugAn unexpected behavior, bug, or errorAn unexpected behavior, bug, or error