Preprocess and fine-tune LLMs quickly and cost-effectively using Amazon EMR Serverless and Amazon SageMaker | Amazon Web Services Source Cluster: AWS Big Data Source Node: 3093108Time Stamp: Feb 1, 2024
AWS Lake Formation 2023 year in review | Amazon Web Services Source Cluster: AWS Big Data Source Node: 3070629Time Stamp: Jan 18, 2024
Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake Formation | Amazon Web Services Source Cluster: AWS Big Data Source Node: 3068089Time Stamp: Jan 17, 2024
Orchestrate Amazon EMR Serverless Spark jobs with Amazon MWAA, and data validation using Amazon Athena | Amazon Web Services Source Cluster: AWS Big Data Source Node: 3009324Time Stamp: Dec 12, 2023
Use Amazon EMR with S3 Access Grants to scale Spark access to Amazon S3 | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2979941Time Stamp: Nov 26, 2023
Use generative AI with Amazon EMR, Amazon Bedrock, and English SDK for Apache Spark to unlock insights | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2980843Time Stamp: Nov 16, 2023
Introducing Amazon MWAA support for Apache Airflow version 2.7.2 and deferrable operators | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2968217Time Stamp: Nov 6, 2023
Use IAM runtime roles with Amazon EMR Studio Workspaces and AWS Lake Formation for cross-account fine-grained access control | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2968219Time Stamp: Nov 6, 2023
Spark on AWS Lambda: An Apache Spark runtime for AWS Lambda | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2963284Time Stamp: Oct 30, 2023
How Meesho built a generalized feed ranker using Amazon SageMaker inference | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2947465Time Stamp: Oct 20, 2023
Run Apache Hive workloads using Spark SQL with Amazon EMR on EKS | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2943231Time Stamp: Oct 18, 2023
Orchestrate Amazon EMR Serverless jobs with AWS Step functions | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2932457Time Stamp: Oct 12, 2023
Automated data governance with AWS Glue Data Quality, sensitive data detection, and AWS Lake Formation | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2934245Time Stamp: Oct 10, 2023
Define per-team resource limits for big data workloads using Amazon EMR Serverless | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2919310Time Stamp: Oct 5, 2023
Query big data with resilience using Trino in Amazon EMR with Amazon EC2 Spot Instances for less cost | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2923593Time Stamp: Oct 4, 2023
Apache Iceberg optimization: Solving the small files problem in Amazon EMR | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2916743Time Stamp: Oct 3, 2023
Non-JSON ingestion using Amazon Kinesis Data Streams, Amazon MSK, and Amazon Redshift Streaming Ingestion | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2913873Time Stamp: Oct 2, 2023
Introducing hybrid access mode for AWS Glue Data Catalog to secure access using AWS Lake Formation and IAM and Amazon S3 policies | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2903192Time Stamp: Sep 26, 2023
Capacity Management and Amazon EMR Managed Scaling improvements for Amazon EMR on EC2 clusters | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2869033Time Stamp: Sep 7, 2023
Query your Iceberg tables in data lake using Amazon Redshift (Preview) | Amazon Web Services Source Cluster: AWS Big Data Source Node: 2857736Time Stamp: Aug 31, 2023