❌

Normal view

There are new articles available, click to refresh the page.
Before yesterdayMain stream

AWS brings prompt routing and caching to its Bedrock LLM service

4 December 2024 at 09:25

As businesses move from trying out generative AI in limited prototypes to putting them into production, they are becoming increasingly price conscious. Using large language models (LLMs) isn’t cheap, after all. One way to reduce cost is to go back to an old concept: caching. Another is to route simpler queries to smaller, more cost-efficient […]

Β© 2024 TechCrunch. All rights reserved. For personal use only.

AWS’ new service tackles AI hallucinations

3 December 2024 at 09:47

Amazon Web Services (AWS), Amazon’s cloud computing division, is launching a new tool to combat hallucinations β€” that is, scenarios where an AI model behaves unreliably. Announced at AWS’ re:Invent 2024 conference in Las Vegas, the service, Automated Reasoning checks, validates a model’s responses by cross-referencing customer-supplied info for accuracy. (Yes, the word β€œchecks” is […]

Β© 2024 TechCrunch. All rights reserved. For personal use only.

❌
❌