April 17, 2026Machine learning Introducing granular cost attribution for Amazon Bedrock As AI inference grows into a significant share of cloud spend, understanding […] Read more
April 17, 2026Machine learning Power video semantic search with Amazon Nova Multimodal Embeddings Video semantic search is unlocking new value across industries. The demand for […] Read more
April 17, 2026Machine learning Optimize video semantic search intent with Amazon Nova Model Distillation on Amazon Bedrock Optimizing models for video semantic search requires balancing accuracy, cost, and latency. […] Read more
April 17, 2026Machine learning Nova Forge SDK series part 2: Practical guide to fine-tune Nova models using data mixing capabilities This hands-on guide walks through every step of fine-tuning an Amazon Nova […] Read more
April 17, 2026Machine learning From hours to minutes: How Agentic AI gave marketers time back for what matters Your marketing team loses hours to page assembly, coordination emails, and review […] Read more
April 16, 2026Machine learning How Automated Reasoning checks in Amazon Bedrock transform generative AI compliance Compliance teams in regulated industries spend weeks on manual reviews, pay for […] Read more
April 16, 2026Machine learning Transform retail with AWS generative AI services Online retailers face a persistent challenge: shoppers struggle to determine the fit […] Read more
April 16, 2026Machine learning Cost-efficient custom text-to-SQL using Amazon Nova Micro and Amazon Bedrock on-demand inference Text-to-SQL generation remains a persistent challenge in enterprise AI applications, particularly when […] Read more
April 15, 2026Machine learning Rede Mater Dei de Saúde: Monitoring AI agents in the revenue cycle with Amazon Bedrock AgentCore This post is cowritten by Renata Salvador Grande, Gabriel Bueno and Paulo […] Read more
April 15, 2026Machine learning Accelerating decode-heavy LLM inference with speculative decoding on AWS Trainium and vLLM Practical benchmarks showing faster inter-token latency when deploying Qwen3 models with vLLM, […] Read more