Read file content from S3 bucket with boto3

python amazon-web-services amazon-s3 boto3

boto3 offers a resource model that makes tasks like iterating through objects easier. Unfortunately, StreamingBody doesn't provide readline or readlines.

s3 = boto3.resource('s3')bucket = s3.Bucket('test-bucket')# Iterates through all the objects, doing the pagination for you. Each obj# is an ObjectSummary, so it doesn't contain the body. You'll need to call# get to get the whole body.for obj in bucket.objects.all():    key = obj.key    body = obj.get()['Body'].read()

python amazon-web-services amazon-s3 boto3

You might also consider the smart_open module, which supports iterators:

from smart_open import smart_open# stream lines from an S3 objectfor line in smart_open('s3://mybucket/mykey.txt', 'rb'):    print(line.decode('utf8'))

and context managers:

with smart_open('s3://mybucket/mykey.txt', 'rb') as s3_source:    for line in s3_source:         print(line.decode('utf8'))    s3_source.seek(0)  # seek to the beginning    b1000 = s3_source.read(1000)  # read 1000 bytes

Find smart_open at https://pypi.org/project/smart_open/

python amazon-web-services amazon-s3 boto3

Using the client instead of resource:

s3 = boto3.client('s3')bucket='bucket_name'result = s3.list_objects(Bucket = bucket, Prefix='/something/')for o in result.get('Contents'):    data = s3.get_object(Bucket=bucket, Key=o.get('Key'))    contents = data['Body'].read()    print(contents.decode("utf-8"))

CodeHunter

Read file content from S3 bucket with boto3

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last