I want to write a script to rename downloaded papers with their titles automatically, I'm wondering if there is any library or tricks i can make use of? The PDFs are all generated by TeX and should have some 'formal' structures.
from pyPdf import PdfFileWriter, PdfFileReader
with open(pdf_file_path) as f:
pdf_reader = PdfFileReader(f)
title = get_pdf_title('/home/user/Desktop/my.pdf')
Email codedump link for Extracting titles from PDF files?