在 AWS SageMaker 中将从 S3 读取的字节数组格式转换为 numpy 数组或张量-解网

问：

我已经阅读了一些X_train和y_train，并将它们以内存字节数组的形式上传到 s3，如下所示：

X_train并且是一维数组，例如：y_train

X_train：

array([[ 2. ],[12.9],[ 1.3],[ 5.1],[ 9.6],[ 8.2],...

y_train：

array([[ 43525.],[135675.],[ 46205.],[ 66029.],[112635.],...

    import io
    import sagemaker                               
    import sagemaker.amazon.common as smcl

    sm_session = sagemaker.Session()
    bucket = sm_session.default_bucket()

    buffer = io.BytesIO()

    # writing train data to the form of tensors:
    smcl.write_numpy_to_dense_tensor(buffer, X_train, y_train.reshape(-1))
    buffer.seek(0)


    # Uploading to s3
    file_name = 'Train_data'
    folder_name = 'Test_folder'
    path_to_train_data = os.path.join(folder_name,'train',file_name)
    boto3.resource('s3').Bucket(bucket).Object(path_to_train_data).upload_fileobj(buffer)

我想从 s3 中读回它们并将它们调整为原始形式：

    s3 = boto3.resource('s3')
    bucket = s3.Bucket(bucket)

    
    buf = io.BytesIO()
    bucket.download_fileobj(key_from_s3, buf)
    filecontent_bytes = buf.getvalue()

的输出如下所示：fileconent_byte

b'\n#\xd7\xce(\x00\x00\x00\n\x12\n\x06values\x12\x08\x12\x06\n\x04\x00\x00\x00@\x12\x12\n\...

如何将它们转换为原始形式？谢谢。

Python 数组 amazon-web-services io amazon-sagemaker

在 AWS SageMaker 中将从 S3 读取的字节数组格式转换为 numpy 数组或张量

Convert bytes array format read from S3 to numpy array or tensor in AWS SageMaker

评论

评论