我正在尝试编写一个脚本来检查是否存在许多网址:
import httplib
with open('urls.txt') as urls:
for url in urls:
connection = httplib.HTTPConnection(url)
connection.request("GET")
response = connection.getresponse()
if response.status == 200:
print '[{}]: '.format(url), "Up!"
但我收到了这个错误:
Traceback (most recent call last):
File "test.py", line 5, in <module>
connection = httplib.HTTPConnection(url)
File "/usr/lib/python2.7/httplib.py", line 693, in __init__
self._set_hostport(host, port)
File "/usr/lib/python2.7/httplib.py", line 721, in _set_hostport
raise InvalidURL("nonnumeric port: '%s'" % host[i+1:])
httplib.InvalidURL: nonnumeric port: '//globo.com/galeria/amazonas/a.html
怎么了?
这可能是一个简单的解决方案,在这里
connection = httplib.HTTPConnection(url)
你正在使用httpconnection
所以不需要提供网址,http://OSMQuote.com http://osmquote.com但你需要给予OSMQuote.com http://osmquote.com.
简而言之,删除http://
and https://
从您的网址,因为httplib
正在考虑:
作为端口号,并且端口号必须是数字,
希望这可以帮助!
本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系:hwhale#tublm.com(使用前将#替换为@)