假设您的 csv 文件不太大,您可以通过将两者读入来完成此操作pandas
并使用join
方法。举个例子:
import pandas as pd
file1 = pd.DataFrame({'Title': ['file1.csv', 'file2.csv', 'file3.csv']})
file2 = pd.DataFrame({'Title': ['file1.csv', 'file2.csv', 'file4.csv'],
'Description': ['List of files', 'List of descriptions', 'Something unrelated']})
joined = pd.merge(file1, file2, left_on='Title', right_on='Title')
print joined
这打印:
Title Description
0 file1.csv List of files
1 file2.csv List of descriptions
即仅存在于两者中的文件。
As pandas
可以本机将 csv 读入数据帧,在您的情况下您可以这样做:
import pandas as pd
file1 = pd.DataFrame.from_csv('file1.csv')
file2 = pd.DataFrame.from_csv('file2.csv')
joined = pd.merge(file1, file2, left_on='Title', right_on='Title')
joined.to_csv('Output.csv', index=False)