Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 3067921Time Stamp: Jan 17, 2024
Fine-tune Llama 2 using QLoRA and Deploy it on Amazon SageMaker with AWS Inferentia2 | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 3012051Time Stamp: Dec 13, 2023
Welcome to a New Era of Building in the Cloud with Generative AI on AWS | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2987067Time Stamp: Nov 30, 2023
Scale foundation model inference to hundreds of models with Amazon SageMaker – Part 1 | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2989158Time Stamp: Nov 30, 2023
Intuitivo achieves higher throughput while saving on AI/ML costs using AWS Inferentia and PyTorch | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2957510Time Stamp: Oct 26, 2023
Retrieval-Augmented Generation & RAG Workflows Source Cluster: AI & Machine Learning Source Node: 2955016Time Stamp: Oct 24, 2023
Optimize generative AI workloads for environmental sustainability | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2893104Time Stamp: Sep 21, 2023
Machine learning with decentralized training data using federated learning on Amazon SageMaker | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2839161Time Stamp: Aug 22, 2023
Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2793123Time Stamp: Jul 24, 2023
Reduce energy consumption of your machine learning workloads by up to 90% with AWS purpose-built accelerators | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2735872Time Stamp: Jun 20, 2023
AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2726040Time Stamp: Jun 13, 2023
Scale your machine learning workloads on Amazon ECS powered by AWS Trainium instances | Amazon Web Services Source Cluster: AWS Machine Learning Source Node: 2690131Time Stamp: May 31, 2023
Achieve high performance with lowest cost for generative AI inference using AWS Inferentia2 and AWS Trainium on Amazon SageMaker Source Cluster: AWS Machine Learning Source Node: 2629692Time Stamp: May 4, 2023
How to extend the functionality of AWS Trainium with custom operators Source Cluster: AWS Machine Learning Source Node: 2614083Time Stamp: Apr 27, 2023
Deploy large models at high performance using FasterTransformer on Amazon SageMaker Source Cluster: AWS Machine Learning Source Node: 2590656Time Stamp: Apr 17, 2023
Announcing New Tools for Building with Generative AI on AWS Source Cluster: AWS Machine Learning Source Node: 2580528Time Stamp: Apr 13, 2023
Deploy large language models on AWS Inferentia2 using large model inference containers Source Cluster: AWS Machine Learning Source Node: 2574649Time Stamp: Apr 10, 2023
Maximize performance and reduce your deep learning training cost with AWS Trainium and Amazon SageMaker Source Cluster: AWS Machine Learning Source Node: 2014885Time Stamp: Mar 14, 2023
AWS and Hugging Face collaborate to make generative AI more accessible and cost efficient Source Cluster: AWS Machine Learning Source Node: 1971345Time Stamp: Feb 21, 2023
Scaling distributed training with AWS Trainium and Amazon EKS Source Cluster: AWS Machine Learning Source Node: 1934922Time Stamp: Feb 1, 2023