mLC mLC -4 years ago 187
Python Question

Python: Get zip file from Google Drive API and load its content

I have a

on my
Google Drive
. In that
is a
XML file
, which I want to parse, extract a specific information and save this information on my local computer (or wherever).

My goal is to use Python & Google Drive API (with help of PyDrive) to achieve this. The workflow could be as follows:

  1. Connect to my Google Drive via Google Drive API (PyDrive)

  2. Get my zipfile id

  3. Load my zipfile to memory

  4. Unzip, obtain the XML file

  5. Parse the XML, extract the desired information

  6. Save it as a csv on my local computer

Right now, I am able to do steps 1,2,4,5,6. But I dont know how to load the zipfile into memory without writing it on my local HDD first.

Following PyDrive code will obtain the zipfile and place it on my local HDD, which is not exactly what I want.

toUnzip = drive.CreateFile({'id':'MY_FILE_ID'})

I guess one solution could be as follows:

I could read the zipfile as a string with some encoding:

toUnzip = drive.CreateFile({'id':'MY_FILE_ID'})
zipAsString = toUnzip.GetContentString(encoding='??')

and then, I could somehow (no idea how, perhaps
could be useful) read this string with Python zipfile library. Is this solution even possible? Is there a better way?

Answer Source

Eventually, I solved it using BytesIOand cp862 encoding:

toUnzipStringContent = toUnzip.GetContentString(encoding='cp862')
toUnzipBytesContent = BytesIO(toUnzipStringContent.encode('cp862'))
readZipfile = zipfile.ZipFile(toUnzipBytesContent, "r")
Recommended from our users: Dynamic Network Monitoring from WhatsUp Gold from IPSwitch. Free Download