Accelerate the Value of Data

LCA as AWS Lambda: Deployment

Learn how to use AWS Lambda: Deployment to implement LCA.

To deploy an LCA in the AWS Lambda service, you need to create AWS IAM roles, create an AWS S3 bucket, and create an AWS Lambda function as explained in this section.

To deploy an LCA in the AWS Lambda service:
  1. Create AWS IAM Roles
    Create a new IAM Role for working with the Lambda Functions. This role must have access to S3 and CloudWatch logs, and be able to invoke any other Lambda Functions. The role requires the following permissions:
    • CloudWatchLogsFullAccess
    • AmazonS3FullAccess
    • LambdaFullAccess
    See also: LCA as AWS Lambda: Identity and Access Management
  2. Create an AWS S3 Bucket:
    1. Create a new bucket and folder in S3.
    2. Upload all the LCA JARs to the folder and provide the required access to the user account.
  3. Create an AWS Lambda Function:
    1. Open the AWS Lambda console from the AWS Management Console.
    2. Click the Create Functions button.

    3. Select Author from Scratch.

    4. In the Function name field, type the Lambda Name (for example, NameConcatenate).
    5. In the Runtime field, select Java 17.
    6. Use the default value for Architecture.
    7. In the Execution role area, select the existing IAM role that was created earlier and click Create Function.
    8. In the Lambda details page, click Upload From drop-down and select Amazon S3 location option.

    9. In the Upload a file from Amazon S3 dialog box, provide S3 path and click Save.

    10. In the Lambda details page, click Edit under the Runtime settings section.

    11. In the Handler field, type your main java class name. Ensure that you enter a fully qualified handler class (for example, com.reltio.cust.NameConcatenateClass) and click Save.

    12. To configure Environment Variables refer AWS document: AWS Lambda configuration environment variables.
    13. Click the Configuration tab, then choose General Configuration for Memory and Timeout settings.

  4. Optional: Enable multithreading to enhance performance

    Enable multithreading for CPU-intensive processing to potentially improve performance and cost efficiency. Note that results may vary based on your specific workload.

    To enable multithreading, add the following environment variable:

    • THREAD_POOL_SIZE - Number of threads to be used.
    Recommendation for number of threads based on configured Lambda memory size:
    Lambda memoryTHREAD_POOL_SIZE
    512MB (Default memory size)1
    2048MB2
    4096MB3
    5376MB4
    7168MB5
    8960MB6
    10240MB6