Integrating Speech Recognition Engine and New Low-Power DSP; Supports Wake-on-Voice, Enables Watch Live and Talk™ and Hands-Free Voice Control for Smart TVs
LAS VEGAS — (BUSINESS WIRE) — January 8, 2013 — Conexant Systems Inc. today announced the CX20865, its new high-performance Far-Field Voice input processor system-on-chip (SoC) designed specifically for Smart TVs. The CX20865 is the first dedicated Far-Field Voice solution with integrated Automatic Speech Recognition (ASR) engine, optimized for telepresence and voice control applications. Conexant has integrated a third party ASR with the new low-power DSP to support stand-alone Wake-on-Voice functionality during Smart TV power-off state. Combined with Conexant’s proprietary Far-Field Voice pre-processing algorithms and integrated high-performance 24bit ADCs, the CX20865 adds a differentiated Watch Live and Talk™ experience with popular VoIP applications, such as Skype TV, and delivers accurate voice control performance while the TV is blasting in high volume. The new Selective Source Pickup™ technology maintains speech recognition in the presence of random ambient interference, enabling live demos in the retail environment and tradeshows.
As the centerpiece of the living room, the Smart TV is becoming the new hub for the human interface to other digital appliances and the Cloud. Voice is the natural and ideal hands-free interface for consumers to access and interact with Smart TVs; from basic voice remote controls to advanced voice search, apps interaction, social networking, chat and gaming. However, in order for voice to deliver on the promise of Smart TV experience for users, ambient noise, TV playback volume and distance must be taken into consideration. A pioneer in Far-Field Voice processing technology, Conexant is working closely with leading TV OEMs to overcome these technical challenges.
The new CX20865 hardware features dual cores, high-performance 32bit DSP, processing multi-thread voice streams and a third party ASR. The integrated DC/DC provides advanced power management functions. The device has full-speed USB and I2S I/Os for voice data, and I2C and UART interface for control. The quad microphone input paths have been upgraded with high-performance pre-amp and 24 bit ADCs, delivering 106dB dynamic range full path. Further, each microphone path has dedicated microphone bias to prevent cross-talk.
The voice pre-processing performance is enhanced to deliver >97 percent ASR hit-rate when the TV is playing sound at high volume. The new echo-cancellation supports stereo, 2.1CH, and 2+2CH internal speaker configurations. The improved de-reverberation and true wideband processing has radically enhanced the perceptual voice clarity. To this end, Conexant is also working with leading regional ASR vendors to optimize hit-rates and reduce false detection with their local and Cloud-based ASR engines.
The featured Watch Live and Talk seamlessly blends voice conversation and TV entertainment into a single experience, allowing two parties to “share the moment” watching the same or different TV programs while carrying on a conversation. The experience also applies to voice-chat during multi-party game play on TV, without tethering to a headset.
Selective Source Pickup (SSP) is a breakthrough for Far-Field noise suppression in time-varying noisy environments. SSP distinguishes the targeted audience from random speech/noise interference by means of multi-source separation. The technology is optimized for ASR applications which go beyond traditional emulation of human auditory systems that are limited to human hearing intelligence. “A scenario such as an in-store demo at a large, crowded, noisy retailer shop has been plagued by low hit-rate and false detection,” noted Jonathan Chien, Conexant executive marketing director. “We have taken a unique approach to address this chaotic situation with Selective Source Pickup. The result is a significantly improved voice interaction experience in the retail shops, as the TV OEM intended.”
“Conexant is at the forefront of Far-Field Voice processing technology,” noted Saleel Awsare, vice president and general manager for Conexant. “We provide the only turnkey solution, the necessary expertise and support that enables our TV OEM partners to provide the best-in-class experience that allows them to keep up with the advancements in the human voice interface for Smart TVs.”
The CX20865 supports the following three major voice user modes:
- Conversation – for clear and natural voice clarity in a large room; proven Skype TV certification experience.
- Command and Control – for Local-based ASR hit rate optimization; wake-on voice; noise and echo rejection
- Voice Dictation – for Cloud-based natural language ASR hit-rate optimization; room/scene adaptation
Other key features include:
- Watch Live and Talk
- Selective Source Pickup
- Multi-vendor hit-rate optimization
- Pre-tuned “hidden” microphone module for industrial design esthetics and performance
- Far-Field Voice processing with AEC supporting Stereo, 2.1CH and 2+2 (tweeters and woofers) configurations
- 32bit dual-core DSP with integrated data and program memory
- Integrated DC/DC for power management
- Quad 24 bit ADC and microphone preamp with independent microphone bias supply