Terrill Dicki
  Feb 04, 2025 10:23
Explore how Scale AI’s Felix Su emphasizes the importance of architecture and guardrails in building successful enterprise-grade conversational AI systems.
 
According to ElevenLabs, creating a successful enterprise-grade conversational AI system requires more than just cutting-edge technology. Felix Su, Head of Engineering for Scale’s GenAI Platform, stresses the necessity of integrating the right architecture and controls to ensure AI systems adhere to business logic, brand guidelines, and safety principles.
The Power of Separation: Why Architecture Matters
In the realm of Conversational Voice AI, developers face a choice between using multi-modal, voice-to-voice models or assembling components like ASR (Automatic Speech Recognition), LLMs (Large Language Models), and TTS (Text-to-Speech). Su notes that for enterprises with specific needs, the latter approach often proves more effective.
“In an enterprise setting, AI often can’t be done with just LLMs,” Su explains. He emphasizes the importance of designing complex systems that maintain brand image and operate within strict guidelines. Such systems often require the implementation of custom guardrails on top of LLMs to better control the inputs and outputs.
TIME’s Person of the Year Experience
TIME AI’s recent launch allows readers to engage in interactive conversations about TIME’s journalism, including their iconic Person of the Year content. This implementation stands out due to its use of voice technology, which Su believes adds a unique, engaging element to the user experience.
By leveraging Scale AI’s AI development expertise and ElevenLabs’ Conversational AI platform, the system delivers a human-like interaction while rigorously maintaining control over content and brand voice.
Building Enterprise-Ready Systems
For enterprises aiming to deploy conversational AI at scale, collaboration with Scale AI and ElevenLabs presents a viable solution. Scale AI specializes in creating controlled, safe multi-modal AI systems, while ElevenLabs offers superior voice technology for seamless integration.
Though TIME’s implementation focuses on journalism, the underlying architecture is applicable across various sectors, particularly in enhancing customer service and support. Customers increasingly demand more engaging, human-like interactions, moving beyond basic chatbot functionalities.
Image source: Shutterstock
 
Source: https://blockchain.news/news/scale-ai-felix-su-enterprise-conversational-ai