What Is The Proper Way To Write To The Google App Engine Blobstore As A File In Python 2.5
Solution 1:
Unfortunately python's garbage collector is not perfect. Every write you do creates lots of small objects (via protocol buffer creation) that is not collected by python on the fly for some reason. I found that in mapreduce library I have to do
import gc
gc.collect()
from time to time to keep garbage collector happy.
Solution 2:
Solution 3:
You should write all data at once to avoid problem and speed up - optimize it - it should stop problem but not solve bug -> http://docs.python.org/library/stringio.html. Consider that file.write could be not simple write but request to RPC API which is slow to setup - see SDK code - avoid multi calls/buffer.
With such small amount of data 4000*9 it should not occur - it looks like bug in Google API - just report it http://code.google.com/p/googleappengine/issues/list?can=2&q=&sort=-id&colspec=ID%20Type%20Component%20Status%20Stars%20Summary%20Language%20Priority%20Owner%20Log
Consider that 'create' is marked as experimental http://code.google.com/intl/pl/appengine/docs/python/blobstore/overview.html#Writing_Files_to_the_Blobstore
Fix also finalize bugs - not finalize invalid file or return result if exception!
import StringIO
from __future__ import with_statement
from google.appengine.api import files
from google.appengine.api import blobstore
deffiles_test(limit):
output = StringIO.StringIO()
# workaround!for x inrange(limit):
output.write("Testing \n")
file_name = files.blobstore.create(mime_type='application/octet-stream')
try:
with files.open(file_name, 'a') as f:
# workaround!
f.write(output.getvalue())
finally:
output.close()
files.finalize(file_name)
return files.blobstore.get_blob_key(file_name)
Solution 4:
Maybe I'm wrong, but I am quite sure that the number of calls to write() could be the problem here. I think that is the problem because I had a similar problem when I tried to save a file that was uploaded by a user.
For me this was causing the problem:
with files.open(file_name, 'a') as f:
for chunk in read_in_chunks(data):
f.write(chunk)
For me the problem went away when i changed the chunk size to be 1Mb instead of 1Kb.
Post a Comment for "What Is The Proper Way To Write To The Google App Engine Blobstore As A File In Python 2.5"