我怎样才能替换名字pdf
使用 Python Requests 下载的文件?
我想将其另存为Manual_name1.pdf
not as Elkinson%20Jeffrey.pdf
CSV 文件如下所示:
Manual_name1 https://www.adndrc.org/diymodule/doc_panellist/Elkinson%20Jeffrey.pdf
Manual_name2 http://www.parliament.bm/uploadedFiles/Content/House_Business/Presentation_of_Papers_and_of_Reports/PCA%20Report%209262014.pdf
manual_name3 http://www.ohchr.org/Documents/HRBodies/OPCAT/elections2016/HaimoudRamdan.pdf
我当前的代码:
import os
import csv
import requests
write_path = 'C:\\Users\\hgdht\\Desktop\\Downloader_Automation' # ASSUMING THAT FOLDER EXISTS!
with open('Links.csv', 'r') as csvfile:
spamreader = csv.reader(csvfile)
for link in spamreader:
if not link:
continue
print('-'*72)
pdf_file = link[0].split('/')[-1]
with open(os.path.join(write_path, pdf_file), 'wb') as pdf:
try:
# Try to request PDF from URL
print('TRYING {}...'.format(link[0]))
a = requests.get(link[0], stream=True)
for block in a.iter_content(512):
if not block:
break
pdf.write(block)
print('OK.')
except requests.exceptions.RequestException as e: # This
will catch ONLY Requests exceptions
print('REQUESTS ERROR:')
print(e) # This should tell you more details about the error
代替
pdf_file = link[0].split('/')[-1]
使用 csv 文件中的特定列:
pdf_file = link[1] # (assuming the file name is in the second column)
如果文件名位于第一列,则应该使用
pdf_file = link[0] # (assuming the file name is in the first column)
# OR
import time # put this in the beginning of your script
pdf_file = '{}-{}.pdf'.format(link[0], int(time.time()))
# file name will look like: "name-1495460691.pdf"
但是当使用请求调用链接时,您将必须更改对链接本身的引用:
a = requests.get(link[1], stream=True) # (assuming the link is in the second column)
本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系:hwhale#tublm.com(使用前将#替换为@)