Samsung’s threatench -barkmark puts AI -Chatbots for trial to see if they are ready to replace real workers in everyday offices


  • Samsung threatenbech exposes to chatbots to strict rules without partial credit
  • Samsung uses 2,485 tests across languages ​​to mimic workloads
  • Inputs range from short prompt to documents over twenty thousand characters

The adoption of AI tools at work has grown rapidly and raises concerns not only about automation, but also about how these systems are judged.

Until now, most benchmarks have been narrow in scope, tested AI writers and AI -Chatbot systems with simple prompts that rarely look like office life.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top