US20130223635A1 - Low power audio detection - Google Patents
Low power audio detection Download PDFInfo
- Publication number
- US20130223635A1 US20130223635A1 US13/776,882 US201313776882A US2013223635A1 US 20130223635 A1 US20130223635 A1 US 20130223635A1 US 201313776882 A US201313776882 A US 201313776882A US 2013223635 A1 US2013223635 A1 US 2013223635A1
- Authority
- US
- United States
- Prior art keywords
- audio
- predetermined
- audio signal
- power mode
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/10—Earpieces; Attachments therefor ; Earphones; Monophonic headphones
- H04R1/1041—Mechanical or electronic switches, or control elements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04W—WIRELESS COMMUNICATION NETWORKS
- H04W52/00—Power management, e.g. Transmission Power Control [TPC] or power classes
- H04W52/02—Power saving arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/10—Earpieces; Attachments therefor ; Earphones; Monophonic headphones
- H04R1/1083—Reduction of ambient noise
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2420/00—Details of connection covered by H04R, not provided for in its groups
- H04R2420/07—Applications of wireless loudspeakers or wireless microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2460/00—Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
- H04R2460/03—Aspects of the reduction of energy consumption in hearing devices
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2499/00—Aspects covered by H04R or H04S not otherwise provided for in their subgroups
- H04R2499/10—General applications
- H04R2499/11—Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2499/00—Aspects covered by H04R or H04S not otherwise provided for in their subgroups
- H04R2499/10—General applications
- H04R2499/13—Acoustic transducers and sound field adaptation in vehicles
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2499/00—Aspects covered by H04R or H04S not otherwise provided for in their subgroups
- H04R2499/10—General applications
- H04R2499/15—Transducers incorporated in visual displaying devices, e.g. televisions, computer displays, laptops
Definitions
- the present invention is directed generally to reducing power consumption in devices, and, more particularly, to devices and methods for detecting probable presence of a predetermined audio signal in audio signals while reducing power consumption in a device.
- Various devices have a limited energy supply, such as those that are powered by batteries. Some devices exist which may respond to voice commands or other occasional predetermined sounds (generally referred to herein as audio of interest). In general, devices may process an audio signal to detect any audio of interest. Most of the time, however, there is no audio of interest present in the audio signal. Furthermore, processing of the audio signal may cause the device to consume current, thereby increasing a power consumption in the device. The audio signal processing, thus, may limit a battery lifetime (notably a stand-by time) of the device.
- a device includes a processor coupled to a clock signal generator, a power controller and an audio detector.
- the power controller is configured to control a clock rate provided to the processor by the clock signal generator, to control the device to operate in a low power mode having a relatively low power consumption or in a normal power mode having a relatively high power consumption.
- the audio detector is coupled to the power controller.
- the audio detector is configured to receive audio signals and to detect, in the low power mode, probable presence of a predetermined audio signal in the audio signals.
- the power controller controls the device to switch from the low power mode to the normal power mode responsive to the detected presence of the predetermined audio signal by the audio detector.
- FIG. 1A is a functional block diagram of a device which detects a predetermined audio signal, according to an embodiment of the present invention
- FIG. 1B is a functional block diagram of a device which detects a predetermined audio signal, according to another embodiment of the present invention.
- FIG. 2 is a functional block diagram of an audio detector of the devices shown in FIGS. 1A and 1B , according to an embodiment of the present invention
- FIG. 3 is a functional block diagram of a comparator of the audio detector shown in FIG. 2 , according to an embodiment of the present invention.
- FIG. 4 is a flowchart diagram of a method of detecting a predetermined audio signal, according to an embodiment of the present invention.
- conventional devices may process an audio signal to detect audio of interest.
- Devices may, for example, use conventional voice recognition techniques to continually process the audio signal for audio of interest. These techniques, however, may result in relatively high power consumption.
- One alternative technique may be to periodically process a small burst of audio. For example, 10 ms of audio may be sampled every 100 ms to determine whether any audio of interest is present.
- buttons may interrupt a smooth user experience.
- some devices may use a simple electronic threshold detection (i.e., a noise gate) to indicate the start of audio of interest.
- a simple noise gate may provide too many false positive results in noisy environments and too many false negative results in quiet environments.
- Various devices may include a low power mode and a normal power mode.
- the energy consumption is typically reduced (compared to the normal power mode) by disabling some of the functions of the device.
- the low power mode may be useful, for example, for battery-powered devices.
- One audio detection technique may use a normal power mode processing capability of the system.
- voice recognition techniques typically involve a digital signal processor (DSP) capable of identifying keywords in an audio signal.
- DSP digital signal processor
- Continual use of the DSP may involve higher power consumption in the device.
- Periodic processing of small bursts of audio may also involve waking up significant parts of the system that aren't involved in audio processing, for example, one or more application processors, a general purpose random access memory (RAM) or wired communication hardware (such as a Universal Asynchronous Receiver-Transmitter (UART), a Universal Serial Bus (USB), a Secure Digital Input Output (SDIO), etc.). These components will consume power while the audio processing is taking place.
- UART Universal Asynchronous Receiver-Transmitter
- USB Universal Serial Bus
- SDIO Secure Digital Input Output
- a mobile device may intermittently or continuously detect audio activity, even during an idle mode (where the device is not actively running any application in response to a user's manual input).
- the device may automatically start and end logging of an audio signal based on detected audio activity.
- the precision of an analog to digital converter (ADC) may be controlled (by changing the sampling frequency of the ADC), such that the ADC has a lower precision during a passive audio monitoring state and a higher precision for an active audio logging state, to reduce power consumption or memory usage.
- An exemplary device includes a processor coupled to a clock signal generator, a power controller and an audio detector.
- the power controller may be configured to control a clock rate provided to the processor by the clock signal generator, to control the device to operate in a low power mode having a relatively low power consumption or in a normal power mode having a relatively high power consumption.
- the audio detector is configured to receive audio signals and to detect, in the low power mode, probable presence of a predetermined audio signal in the audio signals.
- the power controller controls the device to switch from the low power mode to the normal power mode responsive to the detected presence of the predetermined audio signal by the audio detector.
- Exemplary devices and methods embodying the present invention include audio detection in a low power mode. Under the low power mode, a clock rate provided to a processor of the device is lower than during a normal power mode. The lower clock rate may be provided to other peripheral components of the device, as well as to the audio detector.
- An exemplary audio detector may detect the probable presence of a predetermined audio signal, based on some aspects of the audio signal. Example embodiments of an audio detector may include more advanced processing than a simple noise gate. Example embodiments of the audio detector may also include more limited processing than conventional audio recognition techniques (such as identification of a keyword). Because exemplary audio detectors may not identify all aspects of the predetermined audio signal, they may have a reduced detection accuracy as compared with audio processing performed during a normal power mode.
- the device may provide more than one level of audio processing, with the audio detector detecting, in the low power mode, the probable presence of the predetermined signal and a DSP detecting, in the normal power mode, the predetermined signal.
- the audio detector may perform detection with a lower accuracy with reduced power consumption (under the low power mode) while the DSP may perform higher accuracy detection with higher power consumption (under the normal power mode), responsive to the audio detector.
- a difference between audio detection of the present invention and conventional full processing of audio is that, with the present invention, when the device is in an idle state (that is, before a start of audio of interest), the device can be in a low power mode.
- a difference between low-power audio detection and other techniques (such as noise gating) to mark the start of audio of interest is that low-power audio detection may provide better selectivity (i.e., better detection accuracy) for triggers while running in a low power mode.
- exemplary audio detectors may use significantly lower power (at least an order of magnitude) than other audio detectors and may be less likely to miss triggers than noise gates.
- One audio detection system includes a wireless headset and a mobile phone.
- the system may use direct user input (a button press) on the wireless headset to initiate detection of voice commands. Once the user input is received, audio from the headset may be routed to the mobile phone for voice processing. If voice commands were to be recognized by this conventional system using voice activation (instead of by direct user input), one way to do so would be by initiating a full wireless connection (such as BluetoothTM), routing all of the audio to the mobile phone and performing voice processing on the phone. Not only does this consume power in an application processor on the mobile phone and in ADCs on the headset, but it consumes power in the Bluetooth chip on the phone and the Bluetooth chip on the headset. Accordingly, this technique may result in poor battery life, especially on the headset.
- direct user input a button press
- the mobile phone can go to sleep completely and the headset can put its Bluetooth link into a lower power mode until the keyword is detected. If the main processor of the headset performs the keyword detection in the normal power mode, however, the power consumption still does not produce an adequate stand-by time on the headset. If, however, low power audio detection techniques are performed by the headset (in accordance with aspects of the present invention), the power consumption of the headset may be reduced, thus increasing the stand-by time of the headset.
- Device 100 may include microphone 102 , audio detector 104 , general processor 106 , digital signal processor (DSP) 110 , power controller 112 , clock signal generator 114 and storage device 122 .
- Device 100 may include other functional components, such as, without being limited to, optional transmitter 124 , optional receiver 126 and optional antenna 128 .
- General processor 106 and storage device 122 may be coupled to audio detector 104 , DSP 110 , power controller 112 , clock signal generator 114 , optional transmitter 124 , optional receiver 126 and/or optional antenna 128 via a data and control bus (not shown).
- Device 100 may include any device having a limited power supply capable of detecting a predetermined audio signal. Examples of device 100 may include, without being limited to, a wireless headset, a mobile phone, a personal digital assistant (PDA), a computer, a television, a remote control, an in-car entertainment center, an AM/FM radio, a clock or a watch.
- PDA personal digital assistant
- Device 100 may be configured to operate in a low power mode or in a normal power mode based on a clock rate of clock signal generator 114 . Selection of a power mode may be controlled by power controller 112 , according to detection of a predetermined audio signal in audio signals 130 by audio detector 104 .
- the predetermined audio signal may include, for example, a predetermined voice signal or a predetermined non-voice audio signal (e.g., a whistle, a clap, a click, etc.).
- audio detector 104 may perform audio detection on audio signals 130 while device 100 is in the low power mode.
- power controller 112 may switch device 100 to operate in the normal power mode.
- audio processing by audio detector 104 in the low power mode may cause device 100 to consume less current than if device 100 were operated in the normal power mode.
- Microphone 102 may capture audio signals 130 from a surrounding environment.
- microphone 102 may include an analog microphone, such that audio signals 130 may represent an analog signal.
- microphone 102 may include a digital microphone, such that audio signals 130 may represent a digital signal.
- microphone 102 may include an analog to digital convertor (ADC) (not shown) to produce the digital signal.
- ADC analog to digital convertor
- Audio signals 130 may be provided to at least one of audio detector 104 , general processor 106 or DSP 110 . Audio signals 130 may also be stored in storage device 122 , described further below.
- Audio detector 104 may receive audio signals 130 and may detect the predetermined audio signal in audio signals 130 , to generate detection signal 132 . Detection signal 132 may be provided to power controller 112 . Audio detector 104 may perform audio detection while device 100 is in the low power mode. Audio detection may be performed continuously or periodically during the low power mode. Audio detector 104 is described further below with respect to FIGS. 2 and 3 . Audio detector 104 may include, for example, a logic circuit, a digital signal processor or a microprocessor.
- audio detector 104 may perform some audio processing of audio signals 130 , based on a comparison of audio signals 130 to a predetermined audio signal. Audio detector 104 may provide more processing capability than a noise gate, but may not provide the detection accuracy of processing performed under the normal power mode (for example, as may be performed by DSP 110 ).
- Detection accuracy of audio detector 104 may be controlled based on a clock rate of clock signal 136 provided to audio detector 104 (described further below). According to an exemplary embodiment, audio detector 104 may have sufficient accuracy to detect probable presence of the predetermined audio signal in audio signals 130 . Audio detector 104 , however, may not be able to detect all aspects of the predetermined audio signal. For example, audio detector 104 may detect the probable presence of a voice signal, but may not be able to identify keywords in the voice signal.
- Audio detector 104 may process an analog signal and/or a digital signal. According to an example embodiment, audio detector 104 may process a digital signal (e.g., from microphone 102 configured as a digital microphone) which includes a user's voice.
- the clock rate e.g., 32 kHz
- clock signal 136 provided to audio detector 104 in the low power mode may be too low for full voice reconstruction of the digital signal. Audio detector 104 , however, may still recover aspects of audio signals 130 which may be useful for determining the probable presence of the user's voice.
- General processor 106 may perform general functions related to the operation of device 100 .
- General processor 106 may not be optimized for power consumption when performing any particular task (such as audio signal processing).
- general processor 106 may have some audio signal processing capabilities (including capabilities greater than a noise gate), but may not be optimized for signal processing (such as DSP 110 ).
- General processor 106 may also be configured to perform audio signal processing at a lower clock rate (during the low power mode).
- General processor 106 may control operation of one or more of microphone 102 , audio detector 104 , DSP 110 , power controller 112 , clock circuit 114 , storage device 122 , optional transmitter 124 , optional receiver 126 and optional antenna 128 .
- General processor 106 may include, for example, a logic circuit, a digital signal processor, a microcontroller or a microprocessor. According to an example embodiment, general processor 106 may include, without being limited to, an Intel 8051 processor.
- DSP 110 may be optimized for a specific task (such as audio signal processing), and that optimization may reduce the power consumption for performing that task (in comparison to general processor 106 ).
- DSP 110 may include any suitable digital signal processor capable of performing audio signal processing.
- DSP 110 in general, may analyze a spectrum of audio signals 130 to determine whether the predetermined audio signal is present.
- DSP 100 may perform any suitable audio recognition technique (such as voice recognition using hidden Markov models (HMMs)) or neural networks), as known by one of skill in the art.
- a detection accuracy of DSP 110 may be configured to be higher than a detection accuracy of audio detector 104 .
- DSP 110 may perform subsequent processing of audio signals 130 (e.g., with higher accuracy), after audio detector 104 detects the probable presence of the predetermined audio signal (in the low power mode). Subsequent detection of the predetermined audio signal by DSP 110 (after initial detection by audio detector 104 ) may be used by power controller 112 to fully power up device 100 in the normal power mode. In this manner, device 100 may provide multiple levels of processing of audio signals 130 to detect the predetermined audio signal, and to control power consumption in device 100 .
- audio detector 104 may be a separate component from general processor 106 . According to another example embodiment, audio detector 104 may be part of general processor 106 (e.g., implemented as software running on general processor 106 ), as indicated by dashed box 108 .
- Power controller 112 may receive detection signal 132 from audio detector 104 and may provide control signal 134 to clock signal generator 114 . Control signal 134 of power controller 112 is used switch operation of device 100 between the low power mode and the normal power mode.
- Clock signal generator 114 is configured to produce a first clock 118 and a second clock 120 . It may also include a switch 116 .
- First clock 118 is a relatively higher accuracy clock signal (with a higher clock rate) whereas second clock 120 is a lower accuracy clock signal (with a lower clock rate) which causes the devices to which it is applied to consume less power than first clock 120 .
- clock signal generator 114 Responsive to control signal 134 from power controller 112 , clock signal generator 114 provides clock signal 136 to audio detector 104 , general processor 106 , DSP 110 , optional transmitter 124 and optional receiver 126 .
- first clock 118 has a higher accuracy than second clock 120
- running audio detector 104 (as well as general processor 106 ) with second clock 120 (in low power mode) may provide less accurate audio detection results than running DSP 110 with first clock 118 (in normal power mode).
- First and second clocks 118 and 120 may be configured in various ways. As one example, first clock 118 may be run from a crystal oscillator and second clock 120 may be run from an oscillator on silicon (e.g. an astable multivibrator or a buffer-ring oscillator).
- Power controller 112 provides control signal 134 to clock signal generator 114 so as to control which one of clocks 118 and 120 is used at any time.
- Power controller 134 is configured so that when device 100 is in the low power mode, the lower power clock signal (second clock 120 ) is used. When device 100 is in the normal power mode, the higher power clock signal (first clock 118 ) is used.
- switch 116 may be set so that first clock 118 is active.
- power controller 112 may set switch 116 so that second clock 120 is active.
- Power controller 112 may also deactivate various components of device 100 in the low power mode, such as DSP 110 .
- Device 100 may include storage device 122 .
- Storage device 122 may store at least a portion of audio signals 130 .
- Storage device 122 may also store one or more predetermined audio signals 214 ( FIG. 2 ), one or more values from audio detector 104 , general processor 106 , DSP 110 , power controller 112 , optional transmitter 124 , optional receiver 126 and/or optional antenna 128 .
- Storage device 122 may include, for example, a RAM, volatile memory, non-volatile memory, a magnetic disk, an optical disk, flash memory or a hard drive. Items such as look up tables may be stored in flash memory or read only memory (ROM). These may be embedded or low power versions dedicated for this purpose. Similarly, some volatile, but low power hardware, possibly flip flops, may be used for storage in this mode.
- storage device 122 may store a portion of audio signals 130 (used by audio detector 104 for initial detection). The stored portion may be used by at least one subsequent processing stage (such as DSP 110 or a later processing stage of audio detector 104 ). If the subsequent stage powers up quickly, the amount of storage may be small enough to be both power and cost efficient. For example, if the subsequent stage powers up in 10 ms, then 160 samples of storage may be used to store an 8 kHz audio signal 130 .
- audio signals 130 may be available to subsequent stage(s) (via storage device 122 ), at least one of the earlier processing stages may not need to be extremely selective (i.e., have a high detection accuracy). For example, a moderate false positive detection rate (e.g., by audio detector 104 ) may be filtered out at a later stage (such as by DSP 110 ).
- the storage of audio signals 130 may also, for example, allow later stage(s) to distinguish between multiple detection triggers while simultaneously allowing earlier stage(s) not to distinguish between these triggers.
- an early stage such as audio detector 104
- a later stage such as DSP 110
- Device 100 may include one or more of optional transmitters 124 which convert signals into a format appropriate for transmission from optional antenna 128 or optional receivers 126 which convert radio signals into a suitable format received from optional antenna 128 .
- Device 100 may include other functional components (not shown), such as a power supply, an amplifier and/or a filter. These components may also have different operating characteristics when in the low power mode compared with the normal power mode. For example, amplifiers could be run in a lower current consumption mode in the low power mode. According to another example, clock references may have laxer tolerances in the low power mode (for example, an R-C clock might be sufficient in the low power mode, so that the crystals may be powered down). Examples of these techniques are described in U.S. Patent App. Pub. No. US 2011/0065413 to Singer.
- FIG. 1B a functional block diagram of an example device 100 ′ is shown, according to another embodiment of the present invention.
- Device 100 ′ is similar to device 100 ( FIG. 1A ), except that audio detector 104 in device 100 ′ is clocked by clock signal 142 of auxiliary clock signal generator 140 .
- audio detector 104 may be clocked separately from the rest of components of device 100 ′. Audio detector 104 may also be powered independently of the other components of device 100 ′. Thus, audio detector 104 may reduce the processing power required by, and thus current consumed by, other components of device 100 ′.
- audio detector 104 components of one or more of audio detector 104 , general processor 106 , power controller 112 , clock signal generator 114 and auxiliary clock signal generator 140 may be implemented in hardware or a combination of hardware and software.
- microphone 102 , audio detector 104 , general processor 106 , DSP 110 , power controller 112 , clock signal generator 114 , storage device 122 , optional transmitter 124 , optional receiver 126 , optional antenna 118 and auxiliary clock signal generator 140 are illustrated as part of one system (for example, formed on a single chip), various components of device 100 (and device 100 ′) may be formed separately.
- hardware and/or software components of devices 100 , 100 ′ may be selected according to numerous factors, such as a desired power consumption and/or a desired materials cost.
- aspects of the present invention are implemented on existing hardware which already includes a low power (i.e., low clock rate) microprocessor (i.e., general processor 106 ), additional components (such as audio detector 104 and power controller 112 ) may have to be added (such as from discrete components) to the hardware. This may increase the number of components and a required area of a printed circuit board (PCB).
- PCB printed circuit board
- aspects of the present invention are implemented as part of a new application-specific integrated circuit (ASIC)
- ASIC application-specific integrated circuit
- an increase in cost for adding some analog processing components may be marginal.
- These analog components may provide some simple processing (such as a noise gate) at lower power consumption than processing by a microprocessor.
- the analog components may occupy a smaller chip area than the chip area used to support extra ROM and/or RAM to extend the microprocessor's program and storage (to perform the audio detection processing).
- an ADC may consume a substantial amount of power.
- a noise gate implemented in a microprocessor on an existing system may also require continual use of an ADC.
- a noise gate implemented with analog components may allow the ADC to be switched off until the input is determined to be sufficiently interesting (i.e., above a threshold).
- Audio detector 104 may include comparator 208 . Audio detector 104 may also include one or more optional components such as analog to digital converter (ADC) 202 , filter 204 (also referred to herein as filter(s) 204 ) and/or level trigger 206 .
- ADC analog to digital converter
- comparator 208 may receive audio signals 130 and may generate detection signal 132 .
- comparator 208 may compare audio signals 130 to a predetermined audio signal 214 (also referred to herein as predetermined audio signal(s) 214 ) to generate detection signal 132 .
- predetermined audio signal 214 also referred to herein as predetermined audio signal(s) 214
- comparator 208 may compare frequency components of audio signals 130 with predetermined audio signal(s) 214 , to detect the probable presence of predetermined audio signal(s) 214 . Comparator 208 is described further below with respect to FIG. 3 .
- audio signals 130 may include an analog signal or a digital signal.
- comparator 208 may be configured to process audio signals 130 in the analog domain and/or in the digital domain.
- audio detector 104 may include two or more comparators 208 .
- each comparator 208 may provide different detection accuracy.
- each comparator 208 may provide different levels of comparison. Examples of comparison may include: whether the audio signal contains voice signals compared to non-voice signals; whether the audio contains a user's voice (or one of a set of users' voices) compared to other voices; or whether the audio contains specific keywords compared to other noises produced by the user.
- predetermined audio signal(s) 214 may also include predetermined non-voice signals, such as, without being limited to, a whistle, a clap or a click.
- Audio detector 104 may include optional ADC 202 .
- Optional ADC 202 may receive audio signals 130 as an analog signal, and may convert audio signals 130 to a digital signal.
- ADC 202 may provide a digital signal to comparator 208 (or to optional filter(s) 204 or to optional level trigger 206 ).
- comparator 208 or to optional filter(s) 204 or to optional level trigger 206 .
- ADC 202 in the low power mode, ADC 202 may operate with a lower accuracy clock (such as using second clock 120 shown in FIG. 1A ) or at a lower frequency than during the normal power mode.
- Audio detector 104 may include optional filter(s) 204 .
- Filter(s) 204 may receive audio signals 130 (or a digitized signal from optional ADC 202 ) and provide a filtered signal to comparator 208 (or to optional level trigger 206 ).
- Optional filter(s) 204 may be configured with filter parameter(s) 210 .
- Optional filter(s) 204 may include any suitable analog domain or frequency domain filters, such as, low pass filters, high pass filters, band pass filters, notch filters, or any combination thereof.
- optional filter(s) 204 may include a high pass filter, to attenuate a direct current (DC) component, for reducing false positive audio detection.
- optional filter(s) 204 may include a band pass filter to pass a range of frequencies corresponding to voice (for example, between about 50 Hz and about 4 kHz).
- Audio detector 104 may include optional level trigger 206 .
- Optional level trigger 206 may receive audio signals 130 (or a digitized signal from optional ADC 202 or a filtered signal from optional filter(s) 204 ) and may provide a trigger signal to comparator 208 .
- Optional level trigger 206 may compare a level of audio signals 130 to optional noise gate threshold 212 . If the level of audio signals 130 is greater than optional noise gate threshold 212 , optional level trigger 206 may trigger comparator 208 to analyze audio signals 130 . Otherwise, comparator 208 may not analyze audio signals 130 . Thus, optional level trigger 206 may operate as a noise gate.
- optional level trigger 206 may receive the analog signal and generate a noise-gated signal.
- the noise-gated signal may be provided to comparator 208 for analysis.
- comparator 208 may be able to obtain, effectively a one bit per sample audio signal for processing.
- device 100 may include storage device 122 , which may store at least a portion of audio signals 130 .
- Storage of audio signals 130 may be controlled during different stages of audio detector 104 .
- storage may be non-volatile and may not be active unless optional level trigger 206 provides a trigger signal to comparator 208 . This could allow storage device 122 ( FIG. 1A ) to be powered off for the majority of the lifetime of device 100 (in the low power mode).
- audio detector 104 may include a microprocessor, which may perform the processing during the low power mode (with low power components). It may be desirable to run audio detector 104 independently from general processor 106 ( FIG. 1A ) of device.
- general processor 106 In the low power mode, general processor 106 ( FIG. 1A ) may be configured into a low leakage current state, by placing its RAMs into a low voltage data retention state. In this state, the RAMs of general processor 106 ( FIG. 1A ) may not be accessed.
- audio detector 104 e.g., a microprocessor
- general processor 106 FIG. 1A
- general processor 106 may be powered off completely (losing its RAM contents but saving power).
- General processor 106 FIG. 1A
- NVRAM non-volatile RAM
- audio detector 104 may be formed from passive components. According to another example embodiment, one or more components of audio detector may be adjusted. For example, at least one component may be adjusted (adapted) responsive to changes in environmental noise conditions. According to another example embodiment, one or more components of audio detector may be trained to detect predetermined audio signal(s) 214 under various noise conditions. According to a further exemplary embodiment, one or more components of audio detector may be capable of learning new predetermined audio signal(s) 214 and/or new noise conditions.
- Adjustment of at least one of optional filter parameter(s) 210 , optional noise gate threshold 212 , predetermined audio signal(s) 214 and comparator 208 is generally indicated by respective optional control signals 216 - 1 , 216 - 2 , 216 - 3 and 216 - 4 .
- Control signals 216 may be provided, for example, by general processor 106 ( FIG. 1A ).
- audio detector 104 may attempt to find filter bank parameters 312 ( FIG. 3 ) of comparator 208 (via control signal 216 - 4 ) that identify different parts of a keyword with good selectivity.
- audio detector 104 via control signal 216 - 1 ) may alter optional filter parameter(s) 210 away from ideal settings for a noise-free environment to reduce noise degradation of audio signals 130 .
- audio detector 104 via control signal 216 - 2 ) may alter optional noise gate threshold 212 away from ideal settings for the noise free environment to reduce false positive triggering by optional level trigger 206 .
- the adaptability of audio detector 104 may be selected to target a particular ratio of wake-ups (i.e., switching to the normal power mode) being, true positives or a particular minimum wake-up rate when using non-ideal settings (e.g., for noisy environments).
- audio detector 104 may be adapted to react to false positives. According to another example embodiment, audio detector 104 may be adapted to compensate for false positives and false negatives. For example, audio detector 104 may alter thresholds and/or other parameters to reduce false positives. Over time, unfortunately, audio detector 104 may reduce the number of false positives while gradually becoming less sensitive to the true positives. With a multi-stage audio detector, if the first stage rejects too many signals, there may be no way to identify false negatives without user interaction. However, if the first stage (such as optional level trigger 206 or one stage of comparator 208 ) allows some false positives through, later stages can use these false positives to ensure that audio detector 104 does not become insensitive to true positives. Audio detector 104 may also allow some target levels of false positives to ensure no or few false negatives.
- one or more components of audio detector 104 may wake up periodically to sample the background noise and/or to adjust filter parameters or other parameters of audio detector 104 .
- device 100 may determine the background noise level and adjust noise gate threshold 212 to be just above the background noise level, effectively generating a rolling average estimate of the current background noise level.
- periodic wake up of components of device 100 may be expensive in terms of power, it may be possible to suppress the wake up when it is known that the environment is quiet. For example, at night the user may typically leave device 100 in a quiet area. Device 100 may set noise gate threshold 212 to a relatively low value and turn off periodic environmental noise adaptation. Device 100 may, thus, be confident that any change in the environment may cause optional level trigger 206 to provide a trigger signal for initial audio detection.
- audio detector 104 may wake up the full device 100 ( FIG. 1A ) in response to a user's trigger; and may also wake up the full device 100 in response to change in environment.
- This double triggering may be generalized.
- the high power mode components of device 100 may teach the low power mode components to wake it up either for a trigger or for a change in the environment.
- Adaptability of audio detector 104 may be assisted by storing of audio signals 130 (such as in storage device 122 of FIG. 1A ) during operation in the low power mode. This may allow the full device 100 ( FIG. 1A ), in the normal power mode, to determine the exact signal that caused triggering of audio detector 104 (in the low power mode). For example, this signal may be applied to a model of the low power circuit with varying parameters to determine new parameters for audio detector 104 .
- parameters of audio detector 104 may be kept constant when device 100 ( FIG. 1A ) is in the low power mode. If adaptation is desired, device 100 may be brought into the normal power mode. Device 100 ( FIG. 1A ) (in the normal power mode) may then determine new parameters, load them into audio detector 104 and return to the low power mode.
- sufficiently sophisticated components of audio detector 104 may be capable of being adapted while remaining in the low power mode (i.e., without switching to the normal power mode as described above).
- audio detector 104 may be able to adapt an initial noise gate threshold 212 while remaining in the low power mode but may switch to the normal power mode to identify a persistent background noise and calculate settings for components of audio detector 104 that may suppress the background noise.
- Audio detector 104 may be capable of being adapted according to other techniques. For example, audio detector 104 may examine a new portion of audio signals 130 after comparator 208 is triggered by optional level trigger 206 , to adjust parameters of audio detector 104 .
- device 100 may assume that the new portion of audio signals 130 is similar to the signal that caused triggering of level trigger 206 .
- Storage device 122 FIG. 1A
- 10 ms of storage may not be of sufficient duration to store a whole keyword trigger. For an entire keyword, it may be desirable to store about 1 to 2 seconds of audio signals 130 . In general, it may be desirable to store between about 10 ms to about 2 seconds of audio signals 130 . More preferably, it may be desirable to store about 100 ms of audio signals 130 . For example, a 100 ms duration may be sufficient to detect that the user is speaking but not the specific word. A 100 ms duration may be long enough to identify a phoneme or, more specifically, that the user is probably speaking the first phoneme of a keyword. If device 100 ( FIG. 1A ) records, for example, 8 bit samples at 4 kHz during that time, only 800 bytes of storage may be needed. With 1 kB of storage, device 100 may be able to increase sampling of any ADCs up to 16 bit samples at 16 kHz while a next stage gets ready for audio detection.
- Comparator 208 may include filter bank 302 , wideband signal detector 304 , narrowband signal detector 306 , storage device 308 and pattern comparator 310 .
- Filter bank 302 may receive audio signals 130 and may apply a plurality of filters to audio signals 130 , according to one or more filter bank parameters 312 (referred to herein as filter bank parameter(s) 312 ).
- Filter bank 302 may include any suitable analog domain or frequency domain filters, such as, low pass filters, high pass filters, band pass filters, notch filters, or any combination thereof.
- filter bank 302 may filter audio signals 130 into three frequency bands, such as a low frequency band, a mid-frequency band and a high frequency band corresponding to frequencies associated with a user's voice (e.g., audio of interest).
- filter bank parameter(s) 312 of filter bank 302 may represent frequencies indicative of a probable presence of predetermined audio signal(s) 214 in audio signals 130 .
- Filter bank parameter(s) 312 may represent filter parameters for filter banks corresponding to a number of different predetermined audio signals 214 . Selection of filter bank parameter(s) 312 may be controlled, for example, by control signal 314 - 1 . Thus, filter bank 302 may be adjusted to detect a number of different predetermined audio signals 214 (such as a number of different voices).
- a plurality of filtered signals from filter bank 302 may be provided to wideband signal detector 304 and narrowband signal detector 306 .
- Wideband detector 304 may analyze a variation in the filtered signals over a wide range of frequencies whereas narrowband detector 306 may analyze a variation in the filtered signals over a narrow range of frequencies.
- Each detector 304 , 306 may compare the analyzed signals to a respective (wideband or narrowband) detection threshold. If the analyzed signals are greater than the respective detection threshold, the corresponding detector may output a respective detection indication.
- voice may contain a mixture of consonants and vowels. Vowels are typically a narrow bandwidth signal (a small range of frequencies), whereas consonants are a wide bandwidth signal (a large range of frequencies).
- Each detector 304 , 306 may simultaneously perform the respective analysis over time. Accordingly, over time, the outputs of detectors 304 and 306 may indicate a pattern of wideband and narrowband signals.
- the detection thresholds and other parameters of wideband signal detector 304 and narrowband signal detector 306 may be adjusted, for example, by respective control signals 314 - 2 and 314 - 3 .
- detectors 304 and 306 may be adjusted to correspond to a number of different predetermined audio signals 214 .
- wideband signal detector 304 and narrowband signal detector 306 are shown in FIG. 3 , in general, any suitable number of detectors may be used to detect a variation over time in the filtered signals (from filter bank 302 ) over one or more frequency bands. For example, a number of narrowband signal detectors 306 may analyze a variation in the power in different frequency bands over time.
- detectors 304 and 306 may perform the frequency analysis using any suitable technique, such as, without being limited to, a fast Fourier transform (FFT) in the frequency domain, or techniques in the analog domain. Variations in specific frequencies may be used to identify whether it is likely that predetermined audio signal(s) 214 is in audio signals 130 .
- FFT fast Fourier transform
- Storage device 308 may receive and store the detection results from detectors 304 and 306 over a period of time, as a detected pattern.
- Storage device 308 may include, for example, a shift register, a random access memory (RAM), a magnetic disk, an optical disk, flash memory or a hard drive.
- RAM random access memory
- Pattern comparator 310 may receive the detected pattern stored in storage device 308 . The detected pattern may be compared to predetermined audio signal(s) 214 . If the detected pattern is substantially similar to predetermined audio signal(s) 214 , pattern comparator 310 may indicate the detected presence of predetermined audio signal 214 , by detection signal 132 .
- pattern comparator 310 may analyze a mix of wideband and narrowband signals (from the detected pattern) at time intervals consistent with predetermined spoken words. It is understood that careful choice of keywords (such as multi-syllable keywords) to wake-up device 100 ( FIG. 1A ) may improve the audio detection accuracy.
- keywords such as multi-syllable keywords
- Parameters of pattern comparator 310 may be adjusted, for example, by control signal 314 - 4 .
- a detection accuracy of pattern comparator 310 may be adjusted.
- one or more components of comparator 208 may be adjusted, for example, responsive to changes in environmental noise conditions.
- one or more components of comparator 208 may be trained to detect predetermined audio signal(s) 214 under various noise conditions.
- one or more components of comparator 208 may be capable of learning new predetermined audio signal(s) 214 and/or new noise conditions. Adjustment of comparator 208 is generally indicated by respective optional control signals 314 - 1 , 314 - 2 , 314 - 3 and 314 - 4 .
- Control signals 314 may be provided, for example, by general processor 106 ( FIG. 1A ).
- audio detector 104 may be configured to learn new keywords. A user may be asked to repeat a new keyword so that audio detector 104 can learn and store the new keyword. Repeated unsuccessful attempts to learn the new keyword may cause comparator 208 (and/or other optional components of audio detector 104 ) to adjust one or more of its parameters.
- FIG. 4 a flowchart diagram of an example method of detecting a predetermined audio signal is shown.
- device 100 FIG. 1A
- power controller 112 FIG. 1A
- clock signal generator 114 may control clock signal generator 114 to use second clock 120 (a lower accuracy clock) to provide clock signal 136 to components of device 100 , including general processor 106 .
- audio signals 130 may be filtered, for example, by at least one filter 204 of audio detector 104 ( FIG. 2 ).
- a level of audio signals 130 may be determined, for example, by level trigger 206 of audio detector 104 ( FIG. 2 ).
- optional step 406 may proceed to optional step 408 .
- one or more additional components of audio detector 104 may be powered up.
- audio detector 104 may power up comparator 208 ( FIG. 2 ).
- Optional step 408 may proceed to step 410 .
- optional step 406 may proceed to step 400 .
- One or more of optional steps 402 - 408 may be repeated.
- audio signals 130 are analyzed to detect a probable presence of a predetermined audio signal 214 in audio signals 130 , for example, by comparator 208 of audio detector 104 ( FIG. 2 ).
- step 412 may proceed to optional step 414 .
- DSP 110 of device 100 may be powered up.
- DSP 110 may be powered up and operated at a reduced clock rate, such as by second clock 120 of clock signal generator 114 ( FIG. 1A ).
- Optional step 414 may proceed to optional step 416 .
- audio signals 130 may be stored (for example, in storage device 122 ( FIG. 1A )) or predetermined audio signal 214 may be repeated by the user (to confirm that predetermined audio signal 214 was indeed indicated).
- step 412 may proceed to step 400 .
- audio signals 130 are analyzed to detect the probable presence of predetermined audio signal 214 in audio signals 130 , for example, by DSP 110 at a reduced clock rate ( FIG. 1A ).
- optional step 418 may proceed to optional step 420 .
- DSP 110 of device 100 may be powered up and operated at a higher clock rate, such as by first clock 118 of clock signal generator 114 .
- Optional step 420 may proceed to optional step 422 .
- step 418 may proceed to step 400 .
- audio signals 130 are analyzed to detect the probable presence of predetermined audio signal 214 in audio signals 130 , for example, by DSP 110 at the higher clock rate ( FIG. 1A ).
- step 424 may proceed to step 426 .
- device 100 may be switched to the normal power mode.
- power controller 112 may control clock signal generator 114 to use first clock 118 (a higher accuracy clock) to provide clock signal 136 to components of device 100 , including general processor 106 .
- step 424 may proceed to step 400 .
- Steps 400 - 424 may be continuously or periodically repeated until predetermined audio signal 214 is detected.
- steps 410 - 412 more advanced audio processing capability
- steps 402 - 408 reduced audio processing capability
- optional steps 414 - 424 most advanced audio processing capability, such as voice recognition processing with HMMs
- one or more products may be implemented in software on microprocessors/general purpose computers (not shown).
- one or more of the functions of the various components may be implemented in software that controls a general purpose computer.
- This software may be embodied in a non-transitory computer readable medium, for example, RAM, a magnetic or optical disk or a memory-card.
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Computer Networks & Wireless Communication (AREA)
- Telephone Function (AREA)
- Electric Clocks (AREA)
Abstract
Description
- This application claims priority to U.S. Provisional Application Ser. No. 61/603,717, entitled “LOW POWER AUDIO DETECTION,” filed Feb. 27, 2012, incorporated fully herein by reference.
- The present invention is directed generally to reducing power consumption in devices, and, more particularly, to devices and methods for detecting probable presence of a predetermined audio signal in audio signals while reducing power consumption in a device.
- Various devices have a limited energy supply, such as those that are powered by batteries. Some devices exist which may respond to voice commands or other occasional predetermined sounds (generally referred to herein as audio of interest). In general, devices may process an audio signal to detect any audio of interest. Most of the time, however, there is no audio of interest present in the audio signal. Furthermore, processing of the audio signal may cause the device to consume current, thereby increasing a power consumption in the device. The audio signal processing, thus, may limit a battery lifetime (notably a stand-by time) of the device.
- The present invention is embodied in devices and methods of detecting a predetermined audio signal in audio signals. A device includes a processor coupled to a clock signal generator, a power controller and an audio detector. The power controller is configured to control a clock rate provided to the processor by the clock signal generator, to control the device to operate in a low power mode having a relatively low power consumption or in a normal power mode having a relatively high power consumption. The audio detector is coupled to the power controller. The audio detector is configured to receive audio signals and to detect, in the low power mode, probable presence of a predetermined audio signal in the audio signals. The power controller controls the device to switch from the low power mode to the normal power mode responsive to the detected presence of the predetermined audio signal by the audio detector.
- The invention may be understood from the following detailed description when read in connection with the accompanying drawing. It is emphasized, according to common practice, that various features of the drawing may not be to scale. On the contrary, the dimensions of the various features may be arbitrarily expanded or reduced for clarity. Moreover, in the drawing, common numerical references are used to represent like features. Included in the drawing are the following figures:
-
FIG. 1A is a functional block diagram of a device which detects a predetermined audio signal, according to an embodiment of the present invention; -
FIG. 1B is a functional block diagram of a device which detects a predetermined audio signal, according to another embodiment of the present invention; -
FIG. 2 is a functional block diagram of an audio detector of the devices shown inFIGS. 1A and 1B , according to an embodiment of the present invention; -
FIG. 3 is a functional block diagram of a comparator of the audio detector shown inFIG. 2 , according to an embodiment of the present invention; and -
FIG. 4 is a flowchart diagram of a method of detecting a predetermined audio signal, according to an embodiment of the present invention. - As discussed above, conventional devices may process an audio signal to detect audio of interest. Devices may, for example, use conventional voice recognition techniques to continually process the audio signal for audio of interest. These techniques, however, may result in relatively high power consumption. One alternative technique may be to periodically process a small burst of audio. For example, 10 ms of audio may be sampled every 100 ms to determine whether any audio of interest is present.
- Other techniques that may be used to indicate the start of audio of interest include direct input by a user to an input component of the device, such as a push-button. However, this may require that the device be accessible to a user and that it be equipped with a suitable input component. Furthermore, button presses may interrupt a smooth user experience. As another example, some devices may use a simple electronic threshold detection (i.e., a noise gate) to indicate the start of audio of interest. A simple noise gate, however, may provide too many false positive results in noisy environments and too many false negative results in quiet environments.
- Various devices may include a low power mode and a normal power mode. In the low power mode, the energy consumption is typically reduced (compared to the normal power mode) by disabling some of the functions of the device. The low power mode may be useful, for example, for battery-powered devices.
- One audio detection technique (such as voice recognition or periodic processing of small bursts of audio) may use a normal power mode processing capability of the system. For example, voice recognition techniques typically involve a digital signal processor (DSP) capable of identifying keywords in an audio signal. Continual use of the DSP may involve higher power consumption in the device. Periodic processing of small bursts of audio may also involve waking up significant parts of the system that aren't involved in audio processing, for example, one or more application processors, a general purpose random access memory (RAM) or wired communication hardware (such as a Universal Asynchronous Receiver-Transmitter (UART), a Universal Serial Bus (USB), a Secure Digital Input Output (SDIO), etc.). These components will consume power while the audio processing is taking place.
- A mobile device may intermittently or continuously detect audio activity, even during an idle mode (where the device is not actively running any application in response to a user's manual input). The device may automatically start and end logging of an audio signal based on detected audio activity. The precision of an analog to digital converter (ADC) may be controlled (by changing the sampling frequency of the ADC), such that the ADC has a lower precision during a passive audio monitoring state and a higher precision for an active audio logging state, to reduce power consumption or memory usage.
- Aspects of embodiments of the present invention relate to devices and methods for detecting probable presence of a predetermined audio signal (i.e., audio of interest) in audio signals. An exemplary device includes a processor coupled to a clock signal generator, a power controller and an audio detector. The power controller may be configured to control a clock rate provided to the processor by the clock signal generator, to control the device to operate in a low power mode having a relatively low power consumption or in a normal power mode having a relatively high power consumption. The audio detector is configured to receive audio signals and to detect, in the low power mode, probable presence of a predetermined audio signal in the audio signals. The power controller controls the device to switch from the low power mode to the normal power mode responsive to the detected presence of the predetermined audio signal by the audio detector.
- Exemplary devices and methods embodying the present invention include audio detection in a low power mode. Under the low power mode, a clock rate provided to a processor of the device is lower than during a normal power mode. The lower clock rate may be provided to other peripheral components of the device, as well as to the audio detector. An exemplary audio detector may detect the probable presence of a predetermined audio signal, based on some aspects of the audio signal. Example embodiments of an audio detector may include more advanced processing than a simple noise gate. Example embodiments of the audio detector may also include more limited processing than conventional audio recognition techniques (such as identification of a keyword). Because exemplary audio detectors may not identify all aspects of the predetermined audio signal, they may have a reduced detection accuracy as compared with audio processing performed during a normal power mode.
- According to an exemplary embodiment, the device may provide more than one level of audio processing, with the audio detector detecting, in the low power mode, the probable presence of the predetermined signal and a DSP detecting, in the normal power mode, the predetermined signal. Thus, the audio detector may perform detection with a lower accuracy with reduced power consumption (under the low power mode) while the DSP may perform higher accuracy detection with higher power consumption (under the normal power mode), responsive to the audio detector.
- A difference between audio detection of the present invention and conventional full processing of audio is that, with the present invention, when the device is in an idle state (that is, before a start of audio of interest), the device can be in a low power mode. A difference between low-power audio detection and other techniques (such as noise gating) to mark the start of audio of interest is that low-power audio detection may provide better selectivity (i.e., better detection accuracy) for triggers while running in a low power mode. In general, exemplary audio detectors may use significantly lower power (at least an order of magnitude) than other audio detectors and may be less likely to miss triggers than noise gates.
- One audio detection system includes a wireless headset and a mobile phone. The system may use direct user input (a button press) on the wireless headset to initiate detection of voice commands. Once the user input is received, audio from the headset may be routed to the mobile phone for voice processing. If voice commands were to be recognized by this conventional system using voice activation (instead of by direct user input), one way to do so would be by initiating a full wireless connection (such as Bluetooth™), routing all of the audio to the mobile phone and performing voice processing on the phone. Not only does this consume power in an application processor on the mobile phone and in ADCs on the headset, but it consumes power in the Bluetooth chip on the phone and the Bluetooth chip on the headset. Accordingly, this technique may result in poor battery life, especially on the headset.
- If, on the other hand, the keyword detection is performed by the headset (in a normal power mode), the mobile phone can go to sleep completely and the headset can put its Bluetooth link into a lower power mode until the keyword is detected. If the main processor of the headset performs the keyword detection in the normal power mode, however, the power consumption still does not produce an adequate stand-by time on the headset. If, however, low power audio detection techniques are performed by the headset (in accordance with aspects of the present invention), the power consumption of the headset may be reduced, thus increasing the stand-by time of the headset.
- Referring to
FIG. 1A , a functional block diagram of anexample device 100 is shown.Device 100 may includemicrophone 102,audio detector 104,general processor 106, digital signal processor (DSP) 110,power controller 112,clock signal generator 114 andstorage device 122.Device 100 may include other functional components, such as, without being limited to,optional transmitter 124,optional receiver 126 andoptional antenna 128.General processor 106 andstorage device 122 may be coupled toaudio detector 104,DSP 110,power controller 112,clock signal generator 114,optional transmitter 124,optional receiver 126 and/oroptional antenna 128 via a data and control bus (not shown). -
Device 100 may include any device having a limited power supply capable of detecting a predetermined audio signal. Examples ofdevice 100 may include, without being limited to, a wireless headset, a mobile phone, a personal digital assistant (PDA), a computer, a television, a remote control, an in-car entertainment center, an AM/FM radio, a clock or a watch. -
Device 100 may be configured to operate in a low power mode or in a normal power mode based on a clock rate ofclock signal generator 114. Selection of a power mode may be controlled bypower controller 112, according to detection of a predetermined audio signal inaudio signals 130 byaudio detector 104. The predetermined audio signal may include, for example, a predetermined voice signal or a predetermined non-voice audio signal (e.g., a whistle, a clap, a click, etc.). - In operation,
audio detector 104 may perform audio detection onaudio signals 130 whiledevice 100 is in the low power mode. When probable presence of a predetermined audio signal (i.e., audio of interest) is detected,power controller 112 may switchdevice 100 to operate in the normal power mode. In general, audio processing byaudio detector 104 in the low power mode may causedevice 100 to consume less current than ifdevice 100 were operated in the normal power mode. -
Microphone 102 may captureaudio signals 130 from a surrounding environment. According to one embodiment,microphone 102 may include an analog microphone, such thataudio signals 130 may represent an analog signal. According to another embodiment,microphone 102 may include a digital microphone, such thataudio signals 130 may represent a digital signal. For example,microphone 102 may include an analog to digital convertor (ADC) (not shown) to produce the digital signal. Audio signals 130 may be provided to at least one ofaudio detector 104,general processor 106 orDSP 110. Audio signals 130 may also be stored instorage device 122, described further below. -
Audio detector 104 may receiveaudio signals 130 and may detect the predetermined audio signal inaudio signals 130, to generatedetection signal 132.Detection signal 132 may be provided topower controller 112.Audio detector 104 may perform audio detection whiledevice 100 is in the low power mode. Audio detection may be performed continuously or periodically during the low power mode.Audio detector 104 is described further below with respect toFIGS. 2 and 3 .Audio detector 104 may include, for example, a logic circuit, a digital signal processor or a microprocessor. - In general,
audio detector 104 may perform some audio processing ofaudio signals 130, based on a comparison ofaudio signals 130 to a predetermined audio signal.Audio detector 104 may provide more processing capability than a noise gate, but may not provide the detection accuracy of processing performed under the normal power mode (for example, as may be performed by DSP 110). - Detection accuracy of
audio detector 104 may be controlled based on a clock rate ofclock signal 136 provided to audio detector 104 (described further below). According to an exemplary embodiment,audio detector 104 may have sufficient accuracy to detect probable presence of the predetermined audio signal in audio signals 130.Audio detector 104, however, may not be able to detect all aspects of the predetermined audio signal. For example,audio detector 104 may detect the probable presence of a voice signal, but may not be able to identify keywords in the voice signal. -
Audio detector 104 may process an analog signal and/or a digital signal. According to an example embodiment,audio detector 104 may process a digital signal (e.g., frommicrophone 102 configured as a digital microphone) which includes a user's voice. The clock rate (e.g., 32 kHz) ofclock signal 136 provided toaudio detector 104 in the low power mode may be too low for full voice reconstruction of the digital signal.Audio detector 104, however, may still recover aspects ofaudio signals 130 which may be useful for determining the probable presence of the user's voice. -
General processor 106 may perform general functions related to the operation ofdevice 100.General processor 106 may not be optimized for power consumption when performing any particular task (such as audio signal processing). In other words,general processor 106 may have some audio signal processing capabilities (including capabilities greater than a noise gate), but may not be optimized for signal processing (such as DSP 110).General processor 106 may also be configured to perform audio signal processing at a lower clock rate (during the low power mode).General processor 106 may control operation of one or more ofmicrophone 102,audio detector 104,DSP 110,power controller 112,clock circuit 114,storage device 122,optional transmitter 124,optional receiver 126 andoptional antenna 128.General processor 106 may include, for example, a logic circuit, a digital signal processor, a microcontroller or a microprocessor. According to an example embodiment,general processor 106 may include, without being limited to, an Intel 8051 processor. - In contrast to
general processor 106,DSP 110 may be optimized for a specific task (such as audio signal processing), and that optimization may reduce the power consumption for performing that task (in comparison to general processor 106).DSP 110 may include any suitable digital signal processor capable of performing audio signal processing.DSP 110, in general, may analyze a spectrum ofaudio signals 130 to determine whether the predetermined audio signal is present.DSP 100 may perform any suitable audio recognition technique (such as voice recognition using hidden Markov models (HMMs)) or neural networks), as known by one of skill in the art. According to an example embodiment, a detection accuracy ofDSP 110 may be configured to be higher than a detection accuracy ofaudio detector 104. - According to an example embodiment,
DSP 110 may perform subsequent processing of audio signals 130 (e.g., with higher accuracy), afteraudio detector 104 detects the probable presence of the predetermined audio signal (in the low power mode). Subsequent detection of the predetermined audio signal by DSP 110 (after initial detection by audio detector 104) may be used bypower controller 112 to fully power updevice 100 in the normal power mode. In this manner,device 100 may provide multiple levels of processing ofaudio signals 130 to detect the predetermined audio signal, and to control power consumption indevice 100. - According to one example embodiment,
audio detector 104 may be a separate component fromgeneral processor 106. According to another example embodiment,audio detector 104 may be part of general processor 106 (e.g., implemented as software running on general processor 106), as indicated by dashedbox 108. -
Power controller 112 may receivedetection signal 132 fromaudio detector 104 and may provide control signal 134 toclock signal generator 114.Control signal 134 ofpower controller 112 is used switch operation ofdevice 100 between the low power mode and the normal power mode. -
Clock signal generator 114 is configured to produce afirst clock 118 and asecond clock 120. It may also include aswitch 116.First clock 118 is a relatively higher accuracy clock signal (with a higher clock rate) whereassecond clock 120 is a lower accuracy clock signal (with a lower clock rate) which causes the devices to which it is applied to consume less power thanfirst clock 120. Responsive to control signal 134 frompower controller 112,clock signal generator 114 providesclock signal 136 toaudio detector 104,general processor 106,DSP 110,optional transmitter 124 andoptional receiver 126. - Because
first clock 118 has a higher accuracy thansecond clock 120, running audio detector 104 (as well as general processor 106) with second clock 120 (in low power mode) may provide less accurate audio detection results than runningDSP 110 with first clock 118 (in normal power mode). First andsecond clocks first clock 118 may be run from a crystal oscillator andsecond clock 120 may be run from an oscillator on silicon (e.g. an astable multivibrator or a buffer-ring oscillator). -
Power controller 112 providescontrol signal 134 toclock signal generator 114 so as to control which one ofclocks Power controller 134 is configured so that whendevice 100 is in the low power mode, the lower power clock signal (second clock 120) is used. Whendevice 100 is in the normal power mode, the higher power clock signal (first clock 118) is used. - In the normal power mode, all components of
device 100 may be active and switch 116 may be set so thatfirst clock 118 is active. In the low power mode,power controller 112 may setswitch 116 so thatsecond clock 120 is active.Power controller 112 may also deactivate various components ofdevice 100 in the low power mode, such asDSP 110. -
Device 100 may includestorage device 122.Storage device 122 may store at least a portion of audio signals 130.Storage device 122 may also store one or more predetermined audio signals 214 (FIG. 2 ), one or more values fromaudio detector 104,general processor 106,DSP 110,power controller 112,optional transmitter 124,optional receiver 126 and/oroptional antenna 128.Storage device 122 may include, for example, a RAM, volatile memory, non-volatile memory, a magnetic disk, an optical disk, flash memory or a hard drive. Items such as look up tables may be stored in flash memory or read only memory (ROM). These may be embedded or low power versions dedicated for this purpose. Similarly, some volatile, but low power hardware, possibly flip flops, may be used for storage in this mode. - According to an example embodiment,
storage device 122 may store a portion of audio signals 130 (used byaudio detector 104 for initial detection). The stored portion may be used by at least one subsequent processing stage (such asDSP 110 or a later processing stage of audio detector 104). If the subsequent stage powers up quickly, the amount of storage may be small enough to be both power and cost efficient. For example, if the subsequent stage powers up in 10 ms, then 160 samples of storage may be used to store an 8 kHzaudio signal 130. - Because audio signals 130 may be available to subsequent stage(s) (via storage device 122), at least one of the earlier processing stages may not need to be extremely selective (i.e., have a high detection accuracy). For example, a moderate false positive detection rate (e.g., by audio detector 104) may be filtered out at a later stage (such as by DSP 110).
- The storage of
audio signals 130 may also, for example, allow later stage(s) to distinguish between multiple detection triggers while simultaneously allowing earlier stage(s) not to distinguish between these triggers. For example, an early stage (such as audio detector 104) may identify that voice was detected and a later stage (such as DSP 110) may examine the same data to determine that a particular word was spoken. -
Device 100 may include one or more ofoptional transmitters 124 which convert signals into a format appropriate for transmission fromoptional antenna 128 oroptional receivers 126 which convert radio signals into a suitable format received fromoptional antenna 128. -
Device 100 may include other functional components (not shown), such as a power supply, an amplifier and/or a filter. These components may also have different operating characteristics when in the low power mode compared with the normal power mode. For example, amplifiers could be run in a lower current consumption mode in the low power mode. According to another example, clock references may have laxer tolerances in the low power mode (for example, an R-C clock might be sufficient in the low power mode, so that the crystals may be powered down). Examples of these techniques are described in U.S. Patent App. Pub. No. US 2011/0065413 to Singer. - Referring to
FIG. 1B , a functional block diagram of anexample device 100′ is shown, according to another embodiment of the present invention.Device 100′ is similar to device 100 (FIG. 1A ), except thataudio detector 104 indevice 100′ is clocked byclock signal 142 of auxiliaryclock signal generator 140. Thus, indevice 100′,audio detector 104 may be clocked separately from the rest of components ofdevice 100′.Audio detector 104 may also be powered independently of the other components ofdevice 100′. Thus,audio detector 104 may reduce the processing power required by, and thus current consumed by, other components ofdevice 100′. - Referring to
FIGS. 1A and 1B , it is understood that components of one or more ofaudio detector 104,general processor 106,power controller 112,clock signal generator 114 and auxiliaryclock signal generator 140 may be implemented in hardware or a combination of hardware and software. Althoughmicrophone 102,audio detector 104,general processor 106,DSP 110,power controller 112,clock signal generator 114,storage device 122,optional transmitter 124,optional receiver 126,optional antenna 118 and auxiliaryclock signal generator 140 are illustrated as part of one system (for example, formed on a single chip), various components of device 100 (anddevice 100′) may be formed separately. - It may be appreciated that hardware and/or software components of
devices - For example, if aspects of the present invention are implemented on existing hardware which already includes a low power (i.e., low clock rate) microprocessor (i.e., general processor 106), additional components (such as
audio detector 104 and power controller 112) may have to be added (such as from discrete components) to the hardware. This may increase the number of components and a required area of a printed circuit board (PCB). - In contrast, if aspects of the present invention are implemented as part of a new application-specific integrated circuit (ASIC), an increase in cost for adding some analog processing components, for example, may be marginal. These analog components, for example, may provide some simple processing (such as a noise gate) at lower power consumption than processing by a microprocessor. As another example, the analog components may occupy a smaller chip area than the chip area used to support extra ROM and/or RAM to extend the microprocessor's program and storage (to perform the audio detection processing).
- Similarly, an ADC may consume a substantial amount of power. A noise gate implemented in a microprocessor on an existing system may also require continual use of an ADC. In contrast, a noise gate implemented with analog components may allow the ADC to be switched off until the input is determined to be sufficiently interesting (i.e., above a threshold).
- Referring next to
FIG. 2 , a functional block diagram ofaudio detector 104 is shown.Audio detector 104 may includecomparator 208.Audio detector 104 may also include one or more optional components such as analog to digital converter (ADC) 202, filter 204 (also referred to herein as filter(s) 204) and/orlevel trigger 206. - According to an exemplary embodiment,
comparator 208 may receiveaudio signals 130 and may generatedetection signal 132. In general,comparator 208 may compareaudio signals 130 to a predetermined audio signal 214 (also referred to herein as predetermined audio signal(s) 214) to generatedetection signal 132. For example,comparator 208 may compare frequency components ofaudio signals 130 with predetermined audio signal(s) 214, to detect the probable presence of predetermined audio signal(s) 214.Comparator 208 is described further below with respect toFIG. 3 . - As discussed above,
audio signals 130 may include an analog signal or a digital signal. Thus,comparator 208 may be configured to processaudio signals 130 in the analog domain and/or in the digital domain. - Although a
single comparator 208 is shown inFIG. 2 ,audio detector 104 may include two ormore comparators 208. According to an example embodiment, eachcomparator 208 may provide different detection accuracy. According to another example embodiment, eachcomparator 208 may provide different levels of comparison. Examples of comparison may include: whether the audio signal contains voice signals compared to non-voice signals; whether the audio contains a user's voice (or one of a set of users' voices) compared to other voices; or whether the audio contains specific keywords compared to other noises produced by the user. As discussed above, predetermined audio signal(s) 214 may also include predetermined non-voice signals, such as, without being limited to, a whistle, a clap or a click. -
Audio detector 104 may includeoptional ADC 202.Optional ADC 202 may receiveaudio signals 130 as an analog signal, and may convertaudio signals 130 to a digital signal.ADC 202 may provide a digital signal to comparator 208 (or to optional filter(s) 204 or to optional level trigger 206). In an example embodiment, in the low power mode,ADC 202 may operate with a lower accuracy clock (such as usingsecond clock 120 shown inFIG. 1A ) or at a lower frequency than during the normal power mode. -
Audio detector 104 may include optional filter(s) 204. Filter(s) 204 may receive audio signals 130 (or a digitized signal from optional ADC 202) and provide a filtered signal to comparator 208 (or to optional level trigger 206). Optional filter(s) 204 may be configured with filter parameter(s) 210. Optional filter(s) 204 may include any suitable analog domain or frequency domain filters, such as, low pass filters, high pass filters, band pass filters, notch filters, or any combination thereof. - According to an example embodiment, optional filter(s) 204 may include a high pass filter, to attenuate a direct current (DC) component, for reducing false positive audio detection. According to another example embodiment, optional filter(s) 204 may include a band pass filter to pass a range of frequencies corresponding to voice (for example, between about 50 Hz and about 4 kHz).
-
Audio detector 104 may includeoptional level trigger 206.Optional level trigger 206 may receive audio signals 130 (or a digitized signal fromoptional ADC 202 or a filtered signal from optional filter(s) 204) and may provide a trigger signal tocomparator 208.Optional level trigger 206 may compare a level ofaudio signals 130 to optionalnoise gate threshold 212. If the level ofaudio signals 130 is greater than optionalnoise gate threshold 212,optional level trigger 206 may triggercomparator 208 to analyzeaudio signals 130. Otherwise,comparator 208 may not analyzeaudio signals 130. Thus,optional level trigger 206 may operate as a noise gate. - According to an example embodiment,
optional level trigger 206 may receive the analog signal and generate a noise-gated signal. The noise-gated signal may be provided tocomparator 208 for analysis. Thus,comparator 208 may be able to obtain, effectively a one bit per sample audio signal for processing. - As discussed above with respect to
FIG. 1A ,device 100 may includestorage device 122, which may store at least a portion of audio signals 130. Storage ofaudio signals 130 may be controlled during different stages ofaudio detector 104. For example, storage may be non-volatile and may not be active unlessoptional level trigger 206 provides a trigger signal tocomparator 208. This could allow storage device 122 (FIG. 1A ) to be powered off for the majority of the lifetime of device 100 (in the low power mode). - According to an example embodiment,
audio detector 104 may include a microprocessor, which may perform the processing during the low power mode (with low power components). It may be desirable to runaudio detector 104 independently from general processor 106 (FIG. 1A ) of device. In the low power mode, general processor 106 (FIG. 1A ) may be configured into a low leakage current state, by placing its RAMs into a low voltage data retention state. In this state, the RAMs of general processor 106 (FIG. 1A ) may not be accessed. Accordingly, audio detector 104 (e.g., a microprocessor) may include RAM (not shown) separate from the RAM of general processor 106 (FIG. 1A ). In some cases, general processor 106 (FIG. 1A ) may be powered off completely (losing its RAM contents but saving power). General processor 106 (FIG. 1A ) may also include non-volatile RAM (NVRAM) to retain its contents when powered off. - According to an example
embodiment audio detector 104 may be formed from passive components. According to another example embodiment, one or more components of audio detector may be adjusted. For example, at least one component may be adjusted (adapted) responsive to changes in environmental noise conditions. According to another example embodiment, one or more components of audio detector may be trained to detect predetermined audio signal(s) 214 under various noise conditions. According to a further exemplary embodiment, one or more components of audio detector may be capable of learning new predetermined audio signal(s) 214 and/or new noise conditions. - Adjustment of at least one of optional filter parameter(s) 210, optional
noise gate threshold 212, predetermined audio signal(s) 214 andcomparator 208 is generally indicated by respective optional control signals 216-1, 216-2, 216-3 and 216-4. Control signals 216 may be provided, for example, by general processor 106 (FIG. 1A ). - For example, during training,
audio detector 104 may attempt to find filter bank parameters 312 (FIG. 3 ) of comparator 208 (via control signal 216-4) that identify different parts of a keyword with good selectivity. To cope with environmental noise, audio detector 104 (via control signal 216-1) may alter optional filter parameter(s) 210 away from ideal settings for a noise-free environment to reduce noise degradation of audio signals 130. As another example, audio detector 104 (via control signal 216-2) may alter optionalnoise gate threshold 212 away from ideal settings for the noise free environment to reduce false positive triggering byoptional level trigger 206. - The adaptability of
audio detector 104 may be selected to target a particular ratio of wake-ups (i.e., switching to the normal power mode) being, true positives or a particular minimum wake-up rate when using non-ideal settings (e.g., for noisy environments). - According to an example embodiment,
audio detector 104 may be adapted to react to false positives. According to another example embodiment,audio detector 104 may be adapted to compensate for false positives and false negatives. For example,audio detector 104 may alter thresholds and/or other parameters to reduce false positives. Over time, unfortunately,audio detector 104 may reduce the number of false positives while gradually becoming less sensitive to the true positives. With a multi-stage audio detector, if the first stage rejects too many signals, there may be no way to identify false negatives without user interaction. However, if the first stage (such asoptional level trigger 206 or one stage of comparator 208) allows some false positives through, later stages can use these false positives to ensure thataudio detector 104 does not become insensitive to true positives.Audio detector 104 may also allow some target levels of false positives to ensure no or few false negatives. - According to an example embodiment, for environmental adaptation, one or more components of audio detector 104 (or of
device 100 ofFIG. 1A ) may wake up periodically to sample the background noise and/or to adjust filter parameters or other parameters ofaudio detector 104. For example,device 100 may determine the background noise level and adjustnoise gate threshold 212 to be just above the background noise level, effectively generating a rolling average estimate of the current background noise level. - Although periodic wake up of components of device 100 (
FIG. 1A ) may be expensive in terms of power, it may be possible to suppress the wake up when it is known that the environment is quiet. For example, at night the user may typically leavedevice 100 in a quiet area.Device 100 may setnoise gate threshold 212 to a relatively low value and turn off periodic environmental noise adaptation.Device 100 may, thus, be confident that any change in the environment may causeoptional level trigger 206 to provide a trigger signal for initial audio detection. - In the above example, it may be appreciated that
audio detector 104 may wake up the full device 100 (FIG. 1A ) in response to a user's trigger; and may also wake up thefull device 100 in response to change in environment. This double triggering may be generalized. In some cases, particularly with constant or near-constant environments (such as driving) the high power mode components ofdevice 100 may teach the low power mode components to wake it up either for a trigger or for a change in the environment. - Adaptability of
audio detector 104 may be assisted by storing of audio signals 130 (such as instorage device 122 ofFIG. 1A ) during operation in the low power mode. This may allow the full device 100 (FIG. 1A ), in the normal power mode, to determine the exact signal that caused triggering of audio detector 104 (in the low power mode). For example, this signal may be applied to a model of the low power circuit with varying parameters to determine new parameters foraudio detector 104. - According to an example embodiment, parameters of
audio detector 104 may be kept constant when device 100 (FIG. 1A ) is in the low power mode. If adaptation is desired,device 100 may be brought into the normal power mode. Device 100 (FIG. 1A ) (in the normal power mode) may then determine new parameters, load them intoaudio detector 104 and return to the low power mode. - According to another example embodiment, sufficiently sophisticated components of
audio detector 104 may be capable of being adapted while remaining in the low power mode (i.e., without switching to the normal power mode as described above). For example,audio detector 104 may be able to adapt an initialnoise gate threshold 212 while remaining in the low power mode but may switch to the normal power mode to identify a persistent background noise and calculate settings for components ofaudio detector 104 that may suppress the background noise. -
Audio detector 104 may be capable of being adapted according to other techniques. For example,audio detector 104 may examine a new portion ofaudio signals 130 aftercomparator 208 is triggered byoptional level trigger 206, to adjust parameters ofaudio detector 104. - For example, device 100 (
FIG. 1A ) may assume that the new portion ofaudio signals 130 is similar to the signal that caused triggering oflevel trigger 206. Storage device 122 (FIG. 1A ) may be configured to store 10 ms of audio. This amount of audio may be of sufficient length between triggering bylevel trigger 206 until the next stage (comparator 208) is ready to process this audio. Accordingly,comparator 208 may expect a voice signal (for example) to follow the trigger. If the voice signal is not detected, audio detector may determine whetheraudio signals 130 are continuously above noise gate threshold 212 (i.e., whethernoise gate threshold 212 is producing false positives). If so,noise gate threshold 212 may be adjusted (or optional filter parameter(s) 210 may be adjusted). - In general, 10 ms of storage may not be of sufficient duration to store a whole keyword trigger. For an entire keyword, it may be desirable to store about 1 to 2 seconds of audio signals 130. In general, it may be desirable to store between about 10 ms to about 2 seconds of audio signals 130. More preferably, it may be desirable to store about 100 ms of
audio signals 130. For example, a 100 ms duration may be sufficient to detect that the user is speaking but not the specific word. A 100 ms duration may be long enough to identify a phoneme or, more specifically, that the user is probably speaking the first phoneme of a keyword. If device 100 (FIG. 1A ) records, for example, 8 bit samples at 4 kHz during that time, only 800 bytes of storage may be needed. With 1 kB of storage,device 100 may be able to increase sampling of any ADCs up to 16 bit samples at 16 kHz while a next stage gets ready for audio detection. - Referring next to
FIG. 3 , a functional block diagram ofcomparator 208 is shown.Comparator 208 may includefilter bank 302, wideband signal detector 304,narrowband signal detector 306,storage device 308 andpattern comparator 310. -
Filter bank 302 may receiveaudio signals 130 and may apply a plurality of filters toaudio signals 130, according to one or more filter bank parameters 312 (referred to herein as filter bank parameter(s) 312).Filter bank 302 may include any suitable analog domain or frequency domain filters, such as, low pass filters, high pass filters, band pass filters, notch filters, or any combination thereof. - For example,
filter bank 302 may filteraudio signals 130 into three frequency bands, such as a low frequency band, a mid-frequency band and a high frequency band corresponding to frequencies associated with a user's voice (e.g., audio of interest). In general, filter bank parameter(s) 312 offilter bank 302 may represent frequencies indicative of a probable presence of predetermined audio signal(s) 214 in audio signals 130. - Filter bank parameter(s) 312 may represent filter parameters for filter banks corresponding to a number of different predetermined audio signals 214. Selection of filter bank parameter(s) 312 may be controlled, for example, by control signal 314-1. Thus,
filter bank 302 may be adjusted to detect a number of different predetermined audio signals 214 (such as a number of different voices). - A plurality of filtered signals from
filter bank 302 may be provided to wideband signal detector 304 andnarrowband signal detector 306. Wideband detector 304 may analyze a variation in the filtered signals over a wide range of frequencies whereasnarrowband detector 306 may analyze a variation in the filtered signals over a narrow range of frequencies. Eachdetector 304, 306 may compare the analyzed signals to a respective (wideband or narrowband) detection threshold. If the analyzed signals are greater than the respective detection threshold, the corresponding detector may output a respective detection indication. - For example, voice may contain a mixture of consonants and vowels. Vowels are typically a narrow bandwidth signal (a small range of frequencies), whereas consonants are a wide bandwidth signal (a large range of frequencies). Each
detector 304, 306 may simultaneously perform the respective analysis over time. Accordingly, over time, the outputs ofdetectors 304 and 306 may indicate a pattern of wideband and narrowband signals. - The detection thresholds and other parameters of wideband signal detector 304 and
narrowband signal detector 306 may be adjusted, for example, by respective control signals 314-2 and 314-3. Forexample detectors 304 and 306 may be adjusted to correspond to a number of different predetermined audio signals 214. - Although wideband signal detector 304 and
narrowband signal detector 306 are shown inFIG. 3 , in general, any suitable number of detectors may be used to detect a variation over time in the filtered signals (from filter bank 302) over one or more frequency bands. For example, a number ofnarrowband signal detectors 306 may analyze a variation in the power in different frequency bands over time. - In general,
detectors 304 and 306 may perform the frequency analysis using any suitable technique, such as, without being limited to, a fast Fourier transform (FFT) in the frequency domain, or techniques in the analog domain. Variations in specific frequencies may be used to identify whether it is likely that predetermined audio signal(s) 214 is inaudio signals 130. -
Storage device 308 may receive and store the detection results fromdetectors 304 and 306 over a period of time, as a detected pattern.Storage device 308 may include, for example, a shift register, a random access memory (RAM), a magnetic disk, an optical disk, flash memory or a hard drive. -
Pattern comparator 310 may receive the detected pattern stored instorage device 308. The detected pattern may be compared to predetermined audio signal(s) 214. If the detected pattern is substantially similar to predetermined audio signal(s) 214,pattern comparator 310 may indicate the detected presence of predeterminedaudio signal 214, bydetection signal 132. - For example,
pattern comparator 310 may analyze a mix of wideband and narrowband signals (from the detected pattern) at time intervals consistent with predetermined spoken words. It is understood that careful choice of keywords (such as multi-syllable keywords) to wake-up device 100 (FIG. 1A ) may improve the audio detection accuracy. - Parameters of
pattern comparator 310 may be adjusted, for example, by control signal 314-4. For example, a detection accuracy ofpattern comparator 310 may be adjusted. - As discussed above with respect to
FIG. 2 , one or more components ofcomparator 208 may be adjusted, for example, responsive to changes in environmental noise conditions. According to another example embodiment, one or more components ofcomparator 208 may be trained to detect predetermined audio signal(s) 214 under various noise conditions. According to a further exemplary embodiment, one or more components ofcomparator 208 may be capable of learning new predetermined audio signal(s) 214 and/or new noise conditions. Adjustment ofcomparator 208 is generally indicated by respective optional control signals 314-1, 314-2, 314-3 and 314-4. Control signals 314 may be provided, for example, by general processor 106 (FIG. 1A ). - For example, audio detector 104 (
FIG. 2 ) may be configured to learn new keywords. A user may be asked to repeat a new keyword so thataudio detector 104 can learn and store the new keyword. Repeated unsuccessful attempts to learn the new keyword may cause comparator 208 (and/or other optional components of audio detector 104) to adjust one or more of its parameters. - Referring next to
FIG. 4 , a flowchart diagram of an example method of detecting a predetermined audio signal is shown. Atstep 400,. device 100 (FIG. 1A ) is maintained in a low power mode. For example, power controller 112 (FIG. 1A ) may controlclock signal generator 114 to use second clock 120 (a lower accuracy clock) to provideclock signal 136 to components ofdevice 100, includinggeneral processor 106. - At
optional step 402,audio signals 130 may be filtered, for example, by at least onefilter 204 of audio detector 104 (FIG. 2 ). Atoptional step 404, a level ofaudio signals 130 may be determined, for example, bylevel trigger 206 of audio detector 104 (FIG. 2 ). Atoptional step 406, it is determined whether the level ofaudio signals 130 is greater thannoise gate threshold 212, for example, bylevel trigger 206 of audio detector 104 (FIG. 2 ). - If it is determined, at
optional step 406, that the level ofaudio signals 130 is greater thannoise gate threshold 212,optional step 406 may proceed tooptional step 408. Atoptional step 408, one or more additional components of audio detector 104 (FIG. 2 ) may be powered up. For example,audio detector 104 may power up comparator 208 (FIG. 2 ).Optional step 408 may proceed to step 410. - If it is determined, at
optional step 406, that the level ofaudio signals 130 is less than or equal tonoise gate threshold 212,optional step 406 may proceed to step 400. One or more of optional steps 402-408 may be repeated. - At
step 410,audio signals 130 are analyzed to detect a probable presence of apredetermined audio signal 214 inaudio signals 130, for example, bycomparator 208 of audio detector 104 (FIG. 2 ). Atstep 412, it is determined whether the presence of predeterminedaudio signal 214 is detected, for example, bycomparator 208 of audio detector 104 (FIG. 2 ). - If it is determined, at
step 412, that thepredetermined audio signal 214 is detected,step 412 may proceed tooptional step 414. Atoptional step 414,DSP 110 of device 100 (FIG. 1A ) may be powered up.DSP 110 may be powered up and operated at a reduced clock rate, such as bysecond clock 120 of clock signal generator 114 (FIG. 1A ).Optional step 414 may proceed tooptional step 416. According to another example embodiment, upon detection of predetermined audio signal 214 (step 412), audio signals 130 may be stored (for example, in storage device 122 (FIG. 1A )) or predeterminedaudio signal 214 may be repeated by the user (to confirm that predeterminedaudio signal 214 was indeed indicated). - If it is determined, at
step 412, thatpredetermined audio signal 214 is not detected,step 412 may proceed to step 400. - At
optional step 416,audio signals 130 are analyzed to detect the probable presence of predeterminedaudio signal 214 inaudio signals 130, for example, byDSP 110 at a reduced clock rate (FIG. 1A ). Atoptional step 418, it is determined whether predeterminedaudio signal 214 is detected, for example, byDSP 110 of device 100 (FIG. 1A ) - If it is determined, at
optional step 418, thatpredetermined audio signal 214 is detected,optional step 418 may proceed tooptional step 420. Atoptional step 420,DSP 110 of device 100 (FIG. 1A ) may be powered up and operated at a higher clock rate, such as byfirst clock 118 ofclock signal generator 114.Optional step 420 may proceed tooptional step 422. - If it is determined, at
optional step 418, thatpredetermined audio signal 214 is not detected,optional step 418 may proceed to step 400. - At
optional step 422,audio signals 130 are analyzed to detect the probable presence of predeterminedaudio signal 214 inaudio signals 130, for example, byDSP 110 at the higher clock rate (FIG. 1A ). Atoptional step 424, it is determined whether predeterminedaudio signal 214 is detected, for example, byDSP 110 of device 100 (FIG. 1A ). - If it is determined, at
optional step 424, thatpredetermined audio signal 214 is detected,optional step 424 may proceed to step 426. - At
step 426,device 100 may be switched to the normal power mode. For example, power controller 112 (FIG. 1A ) may controlclock signal generator 114 to use first clock 118 (a higher accuracy clock) to provideclock signal 136 to components ofdevice 100, includinggeneral processor 106. - If it is determined, at
optional step 424, thatpredetermined audio signal 214 is not detected,optional step 424 may proceed to step 400. - Steps 400-424 may be continuously or periodically repeated until predetermined
audio signal 214 is detected. In general, steps 410-412 (more advanced audio processing capability) combined with optional steps 402-408 (reduced audio processing capability) and/or optional steps 414-424 (most advanced audio processing capability, such as voice recognition processing with HMMs) may be used to trade-off power consumption against audio processing capability. - Although the invention has been described in terms of devices and methods of detecting the probable presence of a predetermined audio signal, it is contemplated that one or more products may be implemented in software on microprocessors/general purpose computers (not shown). In this embodiment, one or more of the functions of the various components may be implemented in software that controls a general purpose computer. This software may be embodied in a non-transitory computer readable medium, for example, RAM, a magnetic or optical disk or a memory-card.
- Although the invention is illustrated and described herein with reference to specific embodiments, the invention is not intended to be limited to the details shown. Rather, various modifications may be made in the details within the scope and range of equivalents of the claims and without departing from the invention.
Claims (27)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/776,882 US9838810B2 (en) | 2012-02-27 | 2013-02-26 | Low power audio detection |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201261603717P | 2012-02-27 | 2012-02-27 | |
US13/776,882 US9838810B2 (en) | 2012-02-27 | 2013-02-26 | Low power audio detection |
Publications (2)
Publication Number | Publication Date |
---|---|
US20130223635A1 true US20130223635A1 (en) | 2013-08-29 |
US9838810B2 US9838810B2 (en) | 2017-12-05 |
Family
ID=48092201
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/776,882 Expired - Fee Related US9838810B2 (en) | 2012-02-27 | 2013-02-26 | Low power audio detection |
Country Status (3)
Country | Link |
---|---|
US (1) | US9838810B2 (en) |
DE (1) | DE102013003273A1 (en) |
GB (1) | GB2501367B (en) |
Cited By (78)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140244269A1 (en) * | 2013-02-28 | 2014-08-28 | Sony Mobile Communications Ab | Device and method for activating with voice input |
US20140257821A1 (en) * | 2013-03-07 | 2014-09-11 | Analog Devices Technology | System and method for processor wake-up based on sensor data |
US8873770B2 (en) * | 2012-10-11 | 2014-10-28 | Cochlear Limited | Audio processing pipeline for auditory prosthesis having a common, and two or more stimulator-specific, frequency-analysis stages |
US20140343949A1 (en) * | 2013-05-17 | 2014-11-20 | Fortemedia, Inc. | Smart microphone device |
US20150006181A1 (en) * | 2013-06-28 | 2015-01-01 | Kopin Corporation | Digital Voice Processing Method and System for Headset Computer |
US20150160707A1 (en) * | 2013-12-06 | 2015-06-11 | Htc Corporation | Portable electronic device |
US20150170645A1 (en) * | 2013-12-13 | 2015-06-18 | Harman International Industries, Inc. | Name-sensitive listening device |
WO2015094369A1 (en) * | 2013-12-20 | 2015-06-25 | Intel Corporation | Transition from low power always listening mode to high power speech recognition mode |
US20150187369A1 (en) * | 2013-12-28 | 2015-07-02 | Saurabh Dadu | Intelligent ancillary electronic device |
US20150222987A1 (en) * | 2014-02-06 | 2015-08-06 | Sol Republic Inc. | Methods for operating audio speaker systems |
US20150269954A1 (en) * | 2014-03-21 | 2015-09-24 | Joseph F. Ryan | Adaptive microphone sampling rate techniques |
WO2016007425A1 (en) * | 2014-07-10 | 2016-01-14 | Google Inc. | Automatically activated visual indicators on computing device |
CN105307097A (en) * | 2015-10-09 | 2016-02-03 | 湖南康通电子科技有限公司 | Online monitoring method for audio power amplifier and system thereof |
CN105632491A (en) * | 2014-11-26 | 2016-06-01 | 三星电子株式会社 | Method and electronic device for voice recognition |
US20160170467A1 (en) * | 2014-12-16 | 2016-06-16 | Stmicroelectronics (Rousset) Sas | Electronic Device Comprising a Wake Up Module Distinct From a Core Domain |
US20160240194A1 (en) * | 2015-02-16 | 2016-08-18 | Samsung Electronics Co., Ltd. | Electronic device and method of operating voice recognition |
US20160240193A1 (en) * | 2015-02-12 | 2016-08-18 | Apple Inc. | Clock Switching in Always-On Component |
KR20160113255A (en) * | 2014-03-31 | 2016-09-28 | 인텔 코포레이션 | Location aware power management scheme for always-on-always-listen voice recognition system |
US9467785B2 (en) | 2013-03-28 | 2016-10-11 | Knowles Electronics, Llc | MEMS apparatus with increased back volume |
US20160302029A1 (en) * | 2014-07-02 | 2016-10-13 | Sonetics Holdings, Inc. | Multiple communication mode headset |
US9478234B1 (en) | 2015-07-13 | 2016-10-25 | Knowles Electronics, Llc | Microphone apparatus and method with catch-up buffer |
US9502028B2 (en) | 2013-10-18 | 2016-11-22 | Knowles Electronics, Llc | Acoustic activity detection apparatus and method |
US9503814B2 (en) | 2013-04-10 | 2016-11-22 | Knowles Electronics, Llc | Differential outputs in multiple motor MEMS devices |
EP3096534A1 (en) * | 2015-05-22 | 2016-11-23 | Nxp B.V. | Microphone control for power saving |
US20170084292A1 (en) * | 2015-09-23 | 2017-03-23 | Samsung Electronics Co., Ltd. | Electronic device and method capable of voice recognition |
US9633655B1 (en) | 2013-05-23 | 2017-04-25 | Knowles Electronics, Llc | Voice sensing and keyword analysis |
US9668051B2 (en) | 2013-09-04 | 2017-05-30 | Knowles Electronics, Llc | Slew rate control apparatus for digital microphones |
WO2017090035A1 (en) * | 2015-11-23 | 2017-06-01 | Essence Smartcare Ltd. | Analog and digital microphone |
US9712923B2 (en) | 2013-05-23 | 2017-07-18 | Knowles Electronics, Llc | VAD detection microphone and method of operating the same |
US9712915B2 (en) | 2014-11-25 | 2017-07-18 | Knowles Electronics, Llc | Reference microphone for non-linear and time variant echo cancellation |
US9711166B2 (en) | 2013-05-23 | 2017-07-18 | Knowles Electronics, Llc | Decimation synchronization in a microphone |
EP3073653A4 (en) * | 2013-11-20 | 2017-07-26 | Soundlly Inc. | Low-power sound wave receiving method and mobile device using same |
KR101770932B1 (en) * | 2013-11-12 | 2017-08-23 | 애플 인크. | Always-on audio control for mobile device |
WO2017151650A1 (en) | 2016-02-29 | 2017-09-08 | Littrell Robert J | A piezoelectric mems device for producing a signal indicative of detection of an acoustic stimulus |
US9831844B2 (en) | 2014-09-19 | 2017-11-28 | Knowles Electronics, Llc | Digital microphone with adjustable gain control |
US9830913B2 (en) | 2013-10-29 | 2017-11-28 | Knowles Electronics, Llc | VAD detection apparatus and method of operation the same |
US9830080B2 (en) | 2015-01-21 | 2017-11-28 | Knowles Electronics, Llc | Low power voice trigger for acoustic apparatus and method |
US9866938B2 (en) | 2015-02-19 | 2018-01-09 | Knowles Electronics, Llc | Interface for microphone-to-microphone communications |
US9883270B2 (en) | 2015-05-14 | 2018-01-30 | Knowles Electronics, Llc | Microphone with coined area |
US9894437B2 (en) | 2016-02-09 | 2018-02-13 | Knowles Electronics, Llc | Microphone assembly with pulse density modulated signal |
US10020008B2 (en) | 2013-05-23 | 2018-07-10 | Knowles Electronics, Llc | Microphone and corresponding digital interface |
US10028054B2 (en) | 2013-10-21 | 2018-07-17 | Knowles Electronics, Llc | Apparatus and method for frequency detection |
US10045104B2 (en) | 2015-08-24 | 2018-08-07 | Knowles Electronics, Llc | Audio calibration using a microphone |
US10121472B2 (en) | 2015-02-13 | 2018-11-06 | Knowles Electronics, Llc | Audio buffer catch-up apparatus and method with two microphones |
US10257616B2 (en) | 2016-07-22 | 2019-04-09 | Knowles Electronics, Llc | Digital microphone assembly with improved frequency response and noise characteristics |
US10291973B2 (en) | 2015-05-14 | 2019-05-14 | Knowles Electronics, Llc | Sensor device with ingress protection |
CN110220589A (en) * | 2019-06-20 | 2019-09-10 | 华电重工股份有限公司 | A kind of noise on-line measuring device and system |
CN110390957A (en) * | 2018-04-19 | 2019-10-29 | 半导体组件工业公司 | Method and apparatus for speech detection |
US10469967B2 (en) | 2015-01-07 | 2019-11-05 | Knowler Electronics, LLC | Utilizing digital microphones for low power keyword detection and noise suppression |
US10499150B2 (en) | 2016-07-05 | 2019-12-03 | Knowles Electronics, Llc | Microphone assembly with digital feedback loop |
US10575117B2 (en) | 2014-12-08 | 2020-02-25 | Harman International Industries, Incorporated | Directional sound modification |
US10628172B2 (en) * | 2016-06-27 | 2020-04-21 | Qualcomm Incorporated | Systems and methods for using distributed universal serial bus (USB) host drivers |
US10678494B2 (en) | 2016-06-27 | 2020-06-09 | Qualcomm Incorporated | Controlling data streams in universal serial bus (USB) systems |
CN111837179A (en) * | 2018-03-12 | 2020-10-27 | 赛普拉斯半导体公司 | System and method for capturing noise for pattern recognition processing |
CN112073862A (en) * | 2019-06-10 | 2020-12-11 | 美商楼氏电子有限公司 | Audible keyword detection and method |
US10908880B2 (en) | 2018-10-19 | 2021-02-02 | Knowles Electronics, Llc | Audio signal circuit with in-place bit-reversal |
CN112423195A (en) * | 2020-11-03 | 2021-02-26 | 广东公信智能会议股份有限公司 | Self-checking device and method for audio transmission |
US10979824B2 (en) | 2016-10-28 | 2021-04-13 | Knowles Electronics, Llc | Transducer assemblies and methods |
US11025356B2 (en) | 2017-09-08 | 2021-06-01 | Knowles Electronics, Llc | Clock synchronization in a master-slave communication system |
US11061642B2 (en) | 2017-09-29 | 2021-07-13 | Knowles Electronics, Llc | Multi-core audio processor with flexible memory allocation |
US11120804B2 (en) | 2019-04-01 | 2021-09-14 | Google Llc | Adaptive management of casting requests and/or user inputs at a rechargeable device |
US11163521B2 (en) | 2016-12-30 | 2021-11-02 | Knowles Electronics, Llc | Microphone assembly with authentication |
US11172312B2 (en) | 2013-05-23 | 2021-11-09 | Knowles Electronics, Llc | Acoustic activity detecting microphone |
CN113726467A (en) * | 2021-07-29 | 2021-11-30 | 黎兴荣 | Electronic product data transmission method, system, storage medium and program product |
US11189262B2 (en) * | 2018-12-18 | 2021-11-30 | Baidu Online Network Technology (Beijing) Co., Ltd. | Method and apparatus for generating model |
WO2022047808A1 (en) * | 2020-09-07 | 2022-03-10 | 华为技术有限公司 | Image processing apparatus, electronic device, and image processing method |
WO2022151651A1 (en) * | 2021-01-13 | 2022-07-21 | 神盾股份有限公司 | Speech assistant system |
CN114828170A (en) * | 2021-01-27 | 2022-07-29 | 华为技术有限公司 | Method and device for improving endurance time of communication device |
US11418882B2 (en) * | 2019-03-14 | 2022-08-16 | Vesper Technologies Inc. | Piezoelectric MEMS device with an adaptive threshold for detection of an acoustic stimulus |
US11438682B2 (en) | 2018-09-11 | 2022-09-06 | Knowles Electronics, Llc | Digital microphone with reduced processing noise |
WO2022193056A1 (en) * | 2021-03-15 | 2022-09-22 | 华为技术有限公司 | Media processing device and method |
US11617048B2 (en) | 2019-03-14 | 2023-03-28 | Qualcomm Incorporated | Microphone having a digital output determined at different power consumption levels |
US20230215450A1 (en) * | 2022-01-06 | 2023-07-06 | Tymphany Worldwide Enterprises Limited | Automatic noise gating |
US11726105B2 (en) | 2019-06-26 | 2023-08-15 | Qualcomm Incorporated | Piezoelectric accelerometer with wake function |
US11930330B2 (en) | 2018-12-21 | 2024-03-12 | Minuendo As | System for monitoring sound |
US20240129370A1 (en) * | 2021-03-03 | 2024-04-18 | Telefonaktiebolaget Lm Ericsson (Publ) | A computer software module arrangement, a circuitry arrangement, an arrangement and a method for an improved user interface for internet of things devices |
US12117320B2 (en) | 2014-05-29 | 2024-10-15 | Apple Inc. | System on a chip with always-on component with powered-down configurations to process audio samples |
WO2025019147A1 (en) * | 2023-07-16 | 2025-01-23 | Cirrus Logic International Semiconductor Ltd. | Clock signal transitioning for multi-mode audio processing systems |
Families Citing this family (49)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2541079B (en) * | 2013-06-26 | 2018-03-14 | Cirrus Logic Int Semiconductor Ltd | Analog-to-digital converter |
GB2553472B (en) * | 2013-06-26 | 2018-05-02 | Cirrus Logic Int Semiconductor Ltd | Analog-to-digital converter |
US9747899B2 (en) * | 2013-06-27 | 2017-08-29 | Amazon Technologies, Inc. | Detecting self-generated wake expressions |
DE102014216654A1 (en) * | 2014-08-21 | 2016-02-25 | Robert Bosch Gmbh | Signal processing circuit for a digital microphone |
US9799349B2 (en) * | 2015-04-24 | 2017-10-24 | Cirrus Logic, Inc. | Analog-to-digital converter (ADC) dynamic range enhancement for voice-activated systems |
US10097919B2 (en) | 2016-02-22 | 2018-10-09 | Sonos, Inc. | Music service selection |
US10095470B2 (en) | 2016-02-22 | 2018-10-09 | Sonos, Inc. | Audio response playback |
US10509626B2 (en) | 2016-02-22 | 2019-12-17 | Sonos, Inc | Handling of loss of pairing between networked devices |
US10264030B2 (en) | 2016-02-22 | 2019-04-16 | Sonos, Inc. | Networked microphone device control |
US9978390B2 (en) | 2016-06-09 | 2018-05-22 | Sonos, Inc. | Dynamic player selection for audio signal processing |
US10134399B2 (en) | 2016-07-15 | 2018-11-20 | Sonos, Inc. | Contextualization of voice inputs |
US10115400B2 (en) | 2016-08-05 | 2018-10-30 | Sonos, Inc. | Multiple voice services |
US9942678B1 (en) | 2016-09-27 | 2018-04-10 | Sonos, Inc. | Audio playback settings for voice interaction |
US10181323B2 (en) | 2016-10-19 | 2019-01-15 | Sonos, Inc. | Arbitration-based voice recognition |
US11183181B2 (en) | 2017-03-27 | 2021-11-23 | Sonos, Inc. | Systems and methods of multiple voice services |
US10475449B2 (en) | 2017-08-07 | 2019-11-12 | Sonos, Inc. | Wake-word detection suppression |
US10048930B1 (en) | 2017-09-08 | 2018-08-14 | Sonos, Inc. | Dynamic computation of system response volume |
US10446165B2 (en) | 2017-09-27 | 2019-10-15 | Sonos, Inc. | Robust short-time fourier transform acoustic echo cancellation during audio playback |
US10051366B1 (en) | 2017-09-28 | 2018-08-14 | Sonos, Inc. | Three-dimensional beam forming with a microphone array |
US10482868B2 (en) | 2017-09-28 | 2019-11-19 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
US10466962B2 (en) | 2017-09-29 | 2019-11-05 | Sonos, Inc. | Media playback system with voice assistance |
US10880650B2 (en) | 2017-12-10 | 2020-12-29 | Sonos, Inc. | Network microphone devices with automatic do not disturb actuation capabilities |
US10818290B2 (en) | 2017-12-11 | 2020-10-27 | Sonos, Inc. | Home graph |
US11175880B2 (en) | 2018-05-10 | 2021-11-16 | Sonos, Inc. | Systems and methods for voice-assisted media content selection |
US10959029B2 (en) | 2018-05-25 | 2021-03-23 | Sonos, Inc. | Determining and adapting to changes in microphone performance of playback devices |
US11076035B2 (en) | 2018-08-28 | 2021-07-27 | Sonos, Inc. | Do not disturb feature for audio notifications |
US10587430B1 (en) | 2018-09-14 | 2020-03-10 | Sonos, Inc. | Networked devices, systems, and methods for associating playback devices based on sound codes |
US11024331B2 (en) | 2018-09-21 | 2021-06-01 | Sonos, Inc. | Voice detection optimization using sound metadata |
US10811015B2 (en) | 2018-09-25 | 2020-10-20 | Sonos, Inc. | Voice detection optimization based on selected voice assistant service |
US11100923B2 (en) | 2018-09-28 | 2021-08-24 | Sonos, Inc. | Systems and methods for selective wake word detection using neural network models |
US10692518B2 (en) | 2018-09-29 | 2020-06-23 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection via multiple network microphone devices |
US11899519B2 (en) | 2018-10-23 | 2024-02-13 | Sonos, Inc. | Multiple stage network microphone device with reduced power consumption and processing load |
EP3654249A1 (en) | 2018-11-15 | 2020-05-20 | Snips | Dilated convolutions and gating for efficient keyword spotting |
US11183183B2 (en) | 2018-12-07 | 2021-11-23 | Sonos, Inc. | Systems and methods of operating media playback systems having multiple voice assistant services |
US11132989B2 (en) | 2018-12-13 | 2021-09-28 | Sonos, Inc. | Networked microphone devices, systems, and methods of localized arbitration |
US10602268B1 (en) | 2018-12-20 | 2020-03-24 | Sonos, Inc. | Optimization of network microphone devices using noise classification |
US10867604B2 (en) | 2019-02-08 | 2020-12-15 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing |
US11120794B2 (en) | 2019-05-03 | 2021-09-14 | Sonos, Inc. | Voice assistant persistence across multiple network microphone devices |
US11138969B2 (en) | 2019-07-31 | 2021-10-05 | Sonos, Inc. | Locally distributed keyword detection |
US10871943B1 (en) | 2019-07-31 | 2020-12-22 | Sonos, Inc. | Noise classification for event detection |
US11189286B2 (en) | 2019-10-22 | 2021-11-30 | Sonos, Inc. | VAS toggle based on device orientation |
US11200900B2 (en) | 2019-12-20 | 2021-12-14 | Sonos, Inc. | Offline voice control |
US11562740B2 (en) | 2020-01-07 | 2023-01-24 | Sonos, Inc. | Voice verification for media playback |
US11556307B2 (en) | 2020-01-31 | 2023-01-17 | Sonos, Inc. | Local voice data processing |
US11308958B2 (en) | 2020-02-07 | 2022-04-19 | Sonos, Inc. | Localized wakeword verification |
US11482224B2 (en) | 2020-05-20 | 2022-10-25 | Sonos, Inc. | Command keywords with input detection windowing |
US11308962B2 (en) | 2020-05-20 | 2022-04-19 | Sonos, Inc. | Input detection windowing |
US11698771B2 (en) | 2020-08-25 | 2023-07-11 | Sonos, Inc. | Vocal guidance engines for playback devices |
US11984123B2 (en) | 2020-11-12 | 2024-05-14 | Sonos, Inc. | Network device interaction by range |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040131214A1 (en) * | 2002-08-21 | 2004-07-08 | Galler Bernard A. | Digital hearing aid battery conservation method and apparatus |
US20050141741A1 (en) * | 2003-12-30 | 2005-06-30 | Van Oerle Gerard | Method to optimize energy consumption in a hearing device as well as a hearing device |
US7418392B1 (en) * | 2003-09-25 | 2008-08-26 | Sensory, Inc. | System and method for controlling the operation of a device by voice commands |
US20080267416A1 (en) * | 2007-02-22 | 2008-10-30 | Personics Holdings Inc. | Method and Device for Sound Detection and Audio Control |
US20090110206A1 (en) * | 2003-01-09 | 2009-04-30 | Aerielle Technologies, Inc. | Method and apparatus for sensing a signal absence of audio and automatically entering low power mode |
US20110249836A1 (en) * | 2010-04-13 | 2011-10-13 | Starkey Laboratories, Inc. | Control of low power or standby modes of a hearing assistance device |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6070140A (en) | 1995-06-05 | 2000-05-30 | Tran; Bao Q. | Speech recognizer |
WO1997007437A1 (en) | 1995-08-21 | 1997-02-27 | Mathurin Trevor S | Audio controlled and activated wristwatch memory aid device |
JP2003202888A (en) | 2002-01-07 | 2003-07-18 | Toshiba Corp | Headset with radio communication function and voice processing system using the same |
GB2391673B (en) | 2002-08-06 | 2005-01-19 | Motorola Inc | Wireless audio monitor |
EP2137722A4 (en) | 2007-03-30 | 2014-06-25 | Savox Comm Oy Ab Ltd | A radio communication device |
US8498678B2 (en) | 2007-07-10 | 2013-07-30 | Texas Instruments Incorporated | System and method for reducing power consumption in a wireless device |
GB2456647B (en) | 2008-01-28 | 2012-09-12 | Cambridge Silicon Radio Ltd | Power-savings receiver |
GB2458499A (en) | 2008-03-20 | 2009-09-23 | Cambridge Silicon Radio Ltd | Sharing access to a data store by a host processor and a signal processor in a mobile phone |
US9112989B2 (en) | 2010-04-08 | 2015-08-18 | Qualcomm Incorporated | System and method of smart audio logging for mobile devices |
KR101192813B1 (en) | 2010-12-14 | 2012-10-26 | (주)이엔엠시스템 | Voice recognition system and method that performs voice recognition regarding low frequency domain sound in standby mode |
-
2013
- 2013-02-26 US US13/776,882 patent/US9838810B2/en not_active Expired - Fee Related
- 2013-02-27 DE DE102013003273A patent/DE102013003273A1/en not_active Withdrawn
- 2013-02-27 GB GB1303502.7A patent/GB2501367B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040131214A1 (en) * | 2002-08-21 | 2004-07-08 | Galler Bernard A. | Digital hearing aid battery conservation method and apparatus |
US20090110206A1 (en) * | 2003-01-09 | 2009-04-30 | Aerielle Technologies, Inc. | Method and apparatus for sensing a signal absence of audio and automatically entering low power mode |
US7418392B1 (en) * | 2003-09-25 | 2008-08-26 | Sensory, Inc. | System and method for controlling the operation of a device by voice commands |
US20050141741A1 (en) * | 2003-12-30 | 2005-06-30 | Van Oerle Gerard | Method to optimize energy consumption in a hearing device as well as a hearing device |
US20080267416A1 (en) * | 2007-02-22 | 2008-10-30 | Personics Holdings Inc. | Method and Device for Sound Detection and Audio Control |
US20110249836A1 (en) * | 2010-04-13 | 2011-10-13 | Starkey Laboratories, Inc. | Control of low power or standby modes of a hearing assistance device |
Cited By (158)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8873770B2 (en) * | 2012-10-11 | 2014-10-28 | Cochlear Limited | Audio processing pipeline for auditory prosthesis having a common, and two or more stimulator-specific, frequency-analysis stages |
US9949042B2 (en) | 2012-10-11 | 2018-04-17 | Cochlear Limited | Audio processing pipeline for auditory prosthesis having a common, and two or more stimulator-specific, frequency-analysis stages |
US10825457B2 (en) * | 2013-02-28 | 2020-11-03 | Sony Corporation | Device and method for activating with voice input |
US20190333509A1 (en) * | 2013-02-28 | 2019-10-31 | Sony Corporation | Device and method for activating with voice input |
US20210005201A1 (en) * | 2013-02-28 | 2021-01-07 | Sony Corporation | Device and method for activating with voice input |
US11580976B2 (en) * | 2013-02-28 | 2023-02-14 | Sony Corporation | Device and method for activating with voice input |
US20140244269A1 (en) * | 2013-02-28 | 2014-08-28 | Sony Mobile Communications Ab | Device and method for activating with voice input |
US10395651B2 (en) * | 2013-02-28 | 2019-08-27 | Sony Corporation | Device and method for activating with voice input |
US9349386B2 (en) * | 2013-03-07 | 2016-05-24 | Analog Device Global | System and method for processor wake-up based on sensor data |
US20140257821A1 (en) * | 2013-03-07 | 2014-09-11 | Analog Devices Technology | System and method for processor wake-up based on sensor data |
US9467785B2 (en) | 2013-03-28 | 2016-10-11 | Knowles Electronics, Llc | MEMS apparatus with increased back volume |
US9503814B2 (en) | 2013-04-10 | 2016-11-22 | Knowles Electronics, Llc | Differential outputs in multiple motor MEMS devices |
US20140343949A1 (en) * | 2013-05-17 | 2014-11-20 | Fortemedia, Inc. | Smart microphone device |
US9712923B2 (en) | 2013-05-23 | 2017-07-18 | Knowles Electronics, Llc | VAD detection microphone and method of operating the same |
US9711166B2 (en) | 2013-05-23 | 2017-07-18 | Knowles Electronics, Llc | Decimation synchronization in a microphone |
US10313796B2 (en) | 2013-05-23 | 2019-06-04 | Knowles Electronics, Llc | VAD detection microphone and method of operating the same |
US10332544B2 (en) | 2013-05-23 | 2019-06-25 | Knowles Electronics, Llc | Microphone and corresponding digital interface |
US10020008B2 (en) | 2013-05-23 | 2018-07-10 | Knowles Electronics, Llc | Microphone and corresponding digital interface |
US11172312B2 (en) | 2013-05-23 | 2021-11-09 | Knowles Electronics, Llc | Acoustic activity detecting microphone |
US9633655B1 (en) | 2013-05-23 | 2017-04-25 | Knowles Electronics, Llc | Voice sensing and keyword analysis |
US20150006181A1 (en) * | 2013-06-28 | 2015-01-01 | Kopin Corporation | Digital Voice Processing Method and System for Headset Computer |
US10070211B2 (en) * | 2013-06-28 | 2018-09-04 | Kopin Corporation | Digital voice processing method and system for headset computer |
US9668051B2 (en) | 2013-09-04 | 2017-05-30 | Knowles Electronics, Llc | Slew rate control apparatus for digital microphones |
US9502028B2 (en) | 2013-10-18 | 2016-11-22 | Knowles Electronics, Llc | Acoustic activity detection apparatus and method |
US10028054B2 (en) | 2013-10-21 | 2018-07-17 | Knowles Electronics, Llc | Apparatus and method for frequency detection |
US9830913B2 (en) | 2013-10-29 | 2017-11-28 | Knowles Electronics, Llc | VAD detection apparatus and method of operation the same |
US11049503B2 (en) | 2013-11-12 | 2021-06-29 | Apple Inc. | Always-on audio control for mobile device |
US12211506B2 (en) | 2013-11-12 | 2025-01-28 | Apple Inc. | Always-on audio control for mobile device |
US10431224B1 (en) | 2013-11-12 | 2019-10-01 | Apple Inc. | Always-on audio control for mobile device |
US10079019B2 (en) | 2013-11-12 | 2018-09-18 | Apple Inc. | Always-on audio control for mobile device |
KR101770932B1 (en) * | 2013-11-12 | 2017-08-23 | 애플 인크. | Always-on audio control for mobile device |
US10276165B2 (en) | 2013-11-12 | 2019-04-30 | Apple Inc. | Always-on audio control for mobile device |
US11862173B2 (en) | 2013-11-12 | 2024-01-02 | Apple Inc. | Always-on audio control for mobile device |
US10573319B2 (en) | 2013-11-12 | 2020-02-25 | Apple Inc. | Always-on audio control for mobile device |
US9953662B2 (en) | 2013-11-20 | 2018-04-24 | Soundlly Inc. | Low-power sound wave reception method and mobile device using the same |
EP3073653A4 (en) * | 2013-11-20 | 2017-07-26 | Soundlly Inc. | Low-power sound wave receiving method and mobile device using same |
US20150160707A1 (en) * | 2013-12-06 | 2015-06-11 | Htc Corporation | Portable electronic device |
US10720153B2 (en) * | 2013-12-13 | 2020-07-21 | Harman International Industries, Incorporated | Name-sensitive listening device |
US20150170645A1 (en) * | 2013-12-13 | 2015-06-18 | Harman International Industries, Inc. | Name-sensitive listening device |
CN105723451A (en) * | 2013-12-20 | 2016-06-29 | 英特尔公司 | Transition from low power always listening mode to high power speech recognition mode |
WO2015094369A1 (en) * | 2013-12-20 | 2015-06-25 | Intel Corporation | Transition from low power always listening mode to high power speech recognition mode |
US20150221307A1 (en) * | 2013-12-20 | 2015-08-06 | Saurin Shah | Transition from low power always listening mode to high power speech recognition mode |
EP3084760A4 (en) * | 2013-12-20 | 2017-08-16 | Intel Corporation | Transition from low power always listening mode to high power speech recognition mode |
US20150187369A1 (en) * | 2013-12-28 | 2015-07-02 | Saurabh Dadu | Intelligent ancillary electronic device |
US9460735B2 (en) * | 2013-12-28 | 2016-10-04 | Intel Corporation | Intelligent ancillary electronic device |
CN105793813A (en) * | 2013-12-28 | 2016-07-20 | 英特尔公司 | Intelligent ancillary electronic device |
US20150222987A1 (en) * | 2014-02-06 | 2015-08-06 | Sol Republic Inc. | Methods for operating audio speaker systems |
US9652532B2 (en) * | 2014-02-06 | 2017-05-16 | Sr Homedics, Llc | Methods for operating audio speaker systems |
US9406313B2 (en) * | 2014-03-21 | 2016-08-02 | Intel Corporation | Adaptive microphone sampling rate techniques |
US20150269954A1 (en) * | 2014-03-21 | 2015-09-24 | Joseph F. Ryan | Adaptive microphone sampling rate techniques |
KR20160113255A (en) * | 2014-03-31 | 2016-09-28 | 인텔 코포레이션 | Location aware power management scheme for always-on-always-listen voice recognition system |
KR102018152B1 (en) * | 2014-03-31 | 2019-09-04 | 인텔 코포레이션 | Location aware power management scheme for always-on-always-listen voice recognition system |
US20170031420A1 (en) * | 2014-03-31 | 2017-02-02 | Intel Corporation | Location aware power management scheme for always-on-always-listen voice recognition system |
CN106062661A (en) * | 2014-03-31 | 2016-10-26 | 英特尔公司 | Location aware power management scheme for always-on-always-listen voice recognition system |
EP3126929A4 (en) * | 2014-03-31 | 2017-11-22 | Intel Corporation | Location aware power management scheme for always-on- always-listen voice recognition system |
US10133332B2 (en) * | 2014-03-31 | 2018-11-20 | Intel Corporation | Location aware power management scheme for always-on-always-listen voice recognition system |
US12117320B2 (en) | 2014-05-29 | 2024-10-15 | Apple Inc. | System on a chip with always-on component with powered-down configurations to process audio samples |
US20160302029A1 (en) * | 2014-07-02 | 2016-10-13 | Sonetics Holdings, Inc. | Multiple communication mode headset |
US9998850B2 (en) * | 2014-07-02 | 2018-06-12 | Sonetyics Holdings, Inc. | Multiple communication mode headset |
US10235846B2 (en) | 2014-07-10 | 2019-03-19 | Google Llc | Automatically activated visual indicators on computing device |
WO2016007425A1 (en) * | 2014-07-10 | 2016-01-14 | Google Inc. | Automatically activated visual indicators on computing device |
US9881465B2 (en) | 2014-07-10 | 2018-01-30 | Google Llc | Automatically activated visual indicators on computing device |
US9831844B2 (en) | 2014-09-19 | 2017-11-28 | Knowles Electronics, Llc | Digital microphone with adjustable gain control |
US9712915B2 (en) | 2014-11-25 | 2017-07-18 | Knowles Electronics, Llc | Reference microphone for non-linear and time variant echo cancellation |
CN105632491A (en) * | 2014-11-26 | 2016-06-01 | 三星电子株式会社 | Method and electronic device for voice recognition |
US9779732B2 (en) | 2014-11-26 | 2017-10-03 | Samsung Electronics Co., Ltd | Method and electronic device for voice recognition |
WO2016085157A1 (en) * | 2014-11-26 | 2016-06-02 | Samsung Electronics Co., Ltd. | Method and electronic device for voice recognition |
US10575117B2 (en) | 2014-12-08 | 2020-02-25 | Harman International Industries, Incorporated | Directional sound modification |
CN105700660B (en) * | 2014-12-16 | 2018-12-21 | 意法半导体(鲁塞)公司 | Electronic equipment with the wake-up module different from core field |
CN109597477A (en) * | 2014-12-16 | 2019-04-09 | 意法半导体(鲁塞)公司 | Electronic equipment with the wake-up module different from core field |
US10955898B2 (en) * | 2014-12-16 | 2021-03-23 | Stmicroelectronics (Rousset) Sas | Electronic device with a wake up module distinct from a core domain |
US10001829B2 (en) * | 2014-12-16 | 2018-06-19 | Stmicroelectronics (Rousset) Sas | Electronic device comprising a wake up module distinct from a core domain |
CN105700660A (en) * | 2014-12-16 | 2016-06-22 | 意法半导体(鲁塞)公司 | Electronic Device Comprising a Wake Up Module Distinct From a Core Domain |
US20160170467A1 (en) * | 2014-12-16 | 2016-06-16 | Stmicroelectronics (Rousset) Sas | Electronic Device Comprising a Wake Up Module Distinct From a Core Domain |
US10469967B2 (en) | 2015-01-07 | 2019-11-05 | Knowler Electronics, LLC | Utilizing digital microphones for low power keyword detection and noise suppression |
US9830080B2 (en) | 2015-01-21 | 2017-11-28 | Knowles Electronics, Llc | Low power voice trigger for acoustic apparatus and method |
WO2016130212A1 (en) * | 2015-02-12 | 2016-08-18 | Apple Inc. | Clock switching in always-on component |
CN107210037A (en) * | 2015-02-12 | 2017-09-26 | 苹果公司 | It is always on the clock switching in part |
CN107210037B (en) * | 2015-02-12 | 2020-10-02 | 苹果公司 | Clock switching in always-on components |
US9653079B2 (en) * | 2015-02-12 | 2017-05-16 | Apple Inc. | Clock switching in always-on component |
EP3257045A4 (en) * | 2015-02-12 | 2018-08-15 | Apple Inc. | Clock switching in always-on component |
US20160240193A1 (en) * | 2015-02-12 | 2016-08-18 | Apple Inc. | Clock Switching in Always-On Component |
US20170213557A1 (en) * | 2015-02-12 | 2017-07-27 | Apple Inc. | Clock Switching in Always-On Component |
JP2018513397A (en) * | 2015-02-12 | 2018-05-24 | アップル インコーポレイテッド | Clock switching in always-on components |
TWI637321B (en) * | 2015-02-12 | 2018-10-01 | 美商蘋果公司 | Method, integrated circuit and system for audio |
KR101994569B1 (en) * | 2015-02-12 | 2019-06-28 | 애플 인크. | Clock Switching on Constant-On Components |
US9928838B2 (en) * | 2015-02-12 | 2018-03-27 | Apple Inc. | Clock switching in always-on component |
KR20170107492A (en) * | 2015-02-12 | 2017-09-25 | 애플 인크. | Clock Switching on Constant-On Components |
US10121472B2 (en) | 2015-02-13 | 2018-11-06 | Knowles Electronics, Llc | Audio buffer catch-up apparatus and method with two microphones |
US12027172B2 (en) * | 2015-02-16 | 2024-07-02 | Samsung Electronics Co., Ltd | Electronic device and method of operating voice recognition function |
US20160240194A1 (en) * | 2015-02-16 | 2016-08-18 | Samsung Electronics Co., Ltd. | Electronic device and method of operating voice recognition |
WO2016133316A1 (en) * | 2015-02-16 | 2016-08-25 | Samsung Electronics Co., Ltd. | Electronic device and method of operating voice recognition function |
US10679628B2 (en) * | 2015-02-16 | 2020-06-09 | Samsung Electronics Co., Ltd | Electronic device and method of operating voice recognition function |
US20200302938A1 (en) * | 2015-02-16 | 2020-09-24 | Samsung Electronics Co., Ltd. | Electronic device and method of operating voice recognition function |
US9866938B2 (en) | 2015-02-19 | 2018-01-09 | Knowles Electronics, Llc | Interface for microphone-to-microphone communications |
US10291973B2 (en) | 2015-05-14 | 2019-05-14 | Knowles Electronics, Llc | Sensor device with ingress protection |
US9883270B2 (en) | 2015-05-14 | 2018-01-30 | Knowles Electronics, Llc | Microphone with coined area |
EP3096534A1 (en) * | 2015-05-22 | 2016-11-23 | Nxp B.V. | Microphone control for power saving |
US9478234B1 (en) | 2015-07-13 | 2016-10-25 | Knowles Electronics, Llc | Microphone apparatus and method with catch-up buffer |
US9711144B2 (en) | 2015-07-13 | 2017-07-18 | Knowles Electronics, Llc | Microphone apparatus and method with catch-up buffer |
US10045104B2 (en) | 2015-08-24 | 2018-08-07 | Knowles Electronics, Llc | Audio calibration using a microphone |
US20170084292A1 (en) * | 2015-09-23 | 2017-03-23 | Samsung Electronics Co., Ltd. | Electronic device and method capable of voice recognition |
US10056096B2 (en) * | 2015-09-23 | 2018-08-21 | Samsung Electronics Co., Ltd. | Electronic device and method capable of voice recognition |
CN105307097A (en) * | 2015-10-09 | 2016-02-03 | 湖南康通电子科技有限公司 | Online monitoring method for audio power amplifier and system thereof |
WO2017090035A1 (en) * | 2015-11-23 | 2017-06-01 | Essence Smartcare Ltd. | Analog and digital microphone |
US10779099B2 (en) * | 2015-11-23 | 2020-09-15 | Essence Smartcare Ltd. | Analog and digital microphone |
US20180352356A1 (en) * | 2015-11-23 | 2018-12-06 | Essence Smartcare Ltd. | Analog and digital microphone |
US9894437B2 (en) | 2016-02-09 | 2018-02-13 | Knowles Electronics, Llc | Microphone assembly with pulse density modulated signal |
US20190124440A1 (en) * | 2016-02-09 | 2019-04-25 | Knowles Electronics, Llc | Microphone assembly with pulse density modulated signal |
US10721557B2 (en) * | 2016-02-09 | 2020-07-21 | Knowles Electronics, Llc | Microphone assembly with pulse density modulated signal |
US10165359B2 (en) | 2016-02-09 | 2018-12-25 | Knowles Electronics, Llc | Microphone assembly with pulse density modulated signal |
US20200344555A1 (en) * | 2016-02-29 | 2020-10-29 | Vesper Technologies Inc. | A piezoelectric mems device for producing a signal indicative of detection of an acoustic stimulus |
US11617041B2 (en) * | 2016-02-29 | 2023-03-28 | Qualcomm Incorporated | Piezoelectric MEMS device for producing a signal indicative of detection of an acoustic stimulus |
US10715922B2 (en) * | 2016-02-29 | 2020-07-14 | Vesper Technologies Inc. | Piezoelectric mems device for producing a signal indicative of detection of an acoustic stimulus |
WO2017151650A1 (en) | 2016-02-29 | 2017-09-08 | Littrell Robert J | A piezoelectric mems device for producing a signal indicative of detection of an acoustic stimulus |
EP4351170A3 (en) * | 2016-02-29 | 2024-07-03 | Qualcomm Technologies, Inc. | A piezoelectric mems device for producing a signal indicative of detection of an acoustic stimulus |
EP3424228A4 (en) * | 2016-02-29 | 2019-08-21 | Vesper Technologies Inc. | PIEZOELECTRIC MEMS DEVICE FOR PRODUCING A SIGNAL INDICATING THE DETECTION OF AN ACOUSTIC STIMULUS |
US20190098417A1 (en) * | 2016-02-29 | 2019-03-28 | Vesper Technologies Inc. | A piezoelectric mems device for producing a signal indicative of detection of an acoustic stimulus |
US10628172B2 (en) * | 2016-06-27 | 2020-04-21 | Qualcomm Incorporated | Systems and methods for using distributed universal serial bus (USB) host drivers |
US10678494B2 (en) | 2016-06-27 | 2020-06-09 | Qualcomm Incorporated | Controlling data streams in universal serial bus (USB) systems |
US11323805B2 (en) | 2016-07-05 | 2022-05-03 | Knowles Electronics, Llc. | Microphone assembly with digital feedback loop |
US10499150B2 (en) | 2016-07-05 | 2019-12-03 | Knowles Electronics, Llc | Microphone assembly with digital feedback loop |
US10880646B2 (en) | 2016-07-05 | 2020-12-29 | Knowles Electronics, Llc | Microphone assembly with digital feedback loop |
US10904672B2 (en) | 2016-07-22 | 2021-01-26 | Knowles Electronics, Llc | Digital microphone assembly with improved frequency response and noise characteristics |
US11304009B2 (en) | 2016-07-22 | 2022-04-12 | Knowles Electronics, Llc | Digital microphone assembly with improved frequency response and noise characteristics |
US10257616B2 (en) | 2016-07-22 | 2019-04-09 | Knowles Electronics, Llc | Digital microphone assembly with improved frequency response and noise characteristics |
US10979824B2 (en) | 2016-10-28 | 2021-04-13 | Knowles Electronics, Llc | Transducer assemblies and methods |
US11163521B2 (en) | 2016-12-30 | 2021-11-02 | Knowles Electronics, Llc | Microphone assembly with authentication |
US11025356B2 (en) | 2017-09-08 | 2021-06-01 | Knowles Electronics, Llc | Clock synchronization in a master-slave communication system |
US11061642B2 (en) | 2017-09-29 | 2021-07-13 | Knowles Electronics, Llc | Multi-core audio processor with flexible memory allocation |
CN111837179A (en) * | 2018-03-12 | 2020-10-27 | 赛普拉斯半导体公司 | System and method for capturing noise for pattern recognition processing |
CN110390957A (en) * | 2018-04-19 | 2019-10-29 | 半导体组件工业公司 | Method and apparatus for speech detection |
US11438682B2 (en) | 2018-09-11 | 2022-09-06 | Knowles Electronics, Llc | Digital microphone with reduced processing noise |
US10908880B2 (en) | 2018-10-19 | 2021-02-02 | Knowles Electronics, Llc | Audio signal circuit with in-place bit-reversal |
US11189262B2 (en) * | 2018-12-18 | 2021-11-30 | Baidu Online Network Technology (Beijing) Co., Ltd. | Method and apparatus for generating model |
US11930330B2 (en) | 2018-12-21 | 2024-03-12 | Minuendo As | System for monitoring sound |
US11418882B2 (en) * | 2019-03-14 | 2022-08-16 | Vesper Technologies Inc. | Piezoelectric MEMS device with an adaptive threshold for detection of an acoustic stimulus |
US12010488B2 (en) | 2019-03-14 | 2024-06-11 | Qualcomm Technologies, Inc. | Microphone having a digital output determined at different power consumption levels |
US20220394384A1 (en) * | 2019-03-14 | 2022-12-08 | Vesper Technologies Inc. | Piezoelectric mems device with an adaptive threshold for detection of an acoustic stimulus |
US11617048B2 (en) | 2019-03-14 | 2023-03-28 | Qualcomm Incorporated | Microphone having a digital output determined at different power consumption levels |
US11930334B2 (en) * | 2019-03-14 | 2024-03-12 | Qualcomm Technologies, Inc. | Piezoelectric MEMS device with an adaptive threshold for detection of an acoustic stimulus |
US12190891B2 (en) | 2019-04-01 | 2025-01-07 | Google Llc | Adaptive management of casting requests and/or user inputs at a rechargeable device |
US11120804B2 (en) | 2019-04-01 | 2021-09-14 | Google Llc | Adaptive management of casting requests and/or user inputs at a rechargeable device |
CN112073862A (en) * | 2019-06-10 | 2020-12-11 | 美商楼氏电子有限公司 | Audible keyword detection and method |
CN110220589A (en) * | 2019-06-20 | 2019-09-10 | 华电重工股份有限公司 | A kind of noise on-line measuring device and system |
US11892466B2 (en) | 2019-06-26 | 2024-02-06 | Qualcomm Technologies, Inc. | Piezoelectric accelerometer with wake function |
US11899039B2 (en) | 2019-06-26 | 2024-02-13 | Qualcomm Technologies, Inc. | Piezoelectric accelerometer with wake function |
US11726105B2 (en) | 2019-06-26 | 2023-08-15 | Qualcomm Incorporated | Piezoelectric accelerometer with wake function |
WO2022047808A1 (en) * | 2020-09-07 | 2022-03-10 | 华为技术有限公司 | Image processing apparatus, electronic device, and image processing method |
CN112423195A (en) * | 2020-11-03 | 2021-02-26 | 广东公信智能会议股份有限公司 | Self-checking device and method for audio transmission |
WO2022151651A1 (en) * | 2021-01-13 | 2022-07-21 | 神盾股份有限公司 | Speech assistant system |
CN114828170A (en) * | 2021-01-27 | 2022-07-29 | 华为技术有限公司 | Method and device for improving endurance time of communication device |
US20240129370A1 (en) * | 2021-03-03 | 2024-04-18 | Telefonaktiebolaget Lm Ericsson (Publ) | A computer software module arrangement, a circuitry arrangement, an arrangement and a method for an improved user interface for internet of things devices |
WO2022193056A1 (en) * | 2021-03-15 | 2022-09-22 | 华为技术有限公司 | Media processing device and method |
CN113726467A (en) * | 2021-07-29 | 2021-11-30 | 黎兴荣 | Electronic product data transmission method, system, storage medium and program product |
US20230215450A1 (en) * | 2022-01-06 | 2023-07-06 | Tymphany Worldwide Enterprises Limited | Automatic noise gating |
WO2025019147A1 (en) * | 2023-07-16 | 2025-01-23 | Cirrus Logic International Semiconductor Ltd. | Clock signal transitioning for multi-mode audio processing systems |
US12261611B2 (en) | 2023-07-16 | 2025-03-25 | Cirrus Logic, Inc. | Clock signal transitioning for multi-mode audio processing systems |
Also Published As
Publication number | Publication date |
---|---|
GB2501367B (en) | 2018-01-03 |
DE102013003273A1 (en) | 2013-08-29 |
GB2501367A (en) | 2013-10-23 |
US9838810B2 (en) | 2017-12-05 |
GB201303502D0 (en) | 2013-04-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9838810B2 (en) | Low power audio detection | |
US11676581B2 (en) | Method and apparatus for evaluating trigger phrase enrollment | |
US10909977B2 (en) | Apparatus and method for power efficient signal conditioning for a voice recognition system | |
US9549273B2 (en) | Selective enabling of a component by a microphone circuit | |
US9703350B2 (en) | Always-on low-power keyword spotting | |
US9992745B2 (en) | Extraction and analysis of buffered audio data using multiple codec rates each greater than a low-power processor rate | |
CN104867495B (en) | Sound recognition apparatus and method of operating the same | |
EP2649844B1 (en) | Processing involving multiple sensors | |
US9406313B2 (en) | Adaptive microphone sampling rate techniques | |
TWI474317B (en) | Signal processing apparatus and signal processing method | |
CN104216677A (en) | Low-power voice gate for device wake-up | |
EP3443440B1 (en) | Waking computing devices based on ambient noise | |
CN106775569B (en) | Device position prompting system and method | |
CN114005198B (en) | Door lock pre-awakening method and device, electronic equipment and storage medium | |
CN104049707B (en) | Always-on low-power keyword detection | |
US10204504B1 (en) | Electronic device and drop warning method | |
CN110703895B (en) | Control method and device | |
CN112435441B (en) | Sleep detection method and wearable electronic device | |
CN116416979A (en) | Cascade audio detection system | |
US20210391840A1 (en) | Audio gain selection | |
CN114913879A (en) | Voice data processing method and device, voice data processing system and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: CAMBRIDGE SILICON RADIO LIMITED, UNITED KINGDOM Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SINGER, STEVEN MARK;HABOUBI, HARITH;WILLIAMS, PETER;SIGNING DATES FROM 20130408 TO 20130409;REEL/FRAME:030195/0626 |
|
AS | Assignment |
Owner name: QUALCOMM TECHNOLOGIES INTERNATIONAL, LTD., UNITED Free format text: CHANGE OF NAME;ASSIGNOR:CAMBRIDGE SILICON RADIO LIMITED;REEL/FRAME:036663/0211 Effective date: 20150813 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20211205 |