US20080160943A1 - Method and apparatus to post-process an audio signal - Google Patents

Method and apparatus to post-process an audio signal Download PDF

Info

Publication number
US20080160943A1
US20080160943A1 US11/755,086 US75508607A US2008160943A1 US 20080160943 A1 US20080160943 A1 US 20080160943A1 US 75508607 A US75508607 A US 75508607A US 2008160943 A1 US2008160943 A1 US 2008160943A1
Authority
US
United States
Prior art keywords
audio signal
frequency band
processing unit
khz
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/755,086
Inventor
Ho-young Sung
Jae-Cheol Lee
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Samsung Electronics Co Ltd
Original Assignee
Samsung Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics Co Ltd filed Critical Samsung Electronics Co Ltd
Assigned to SAMSUNG ELECTRONICS CO., LTD. reassignment SAMSUNG ELECTRONICS CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LEE, JAE-CHEOL, SUNG, HO-YOUNG
Publication of US20080160943A1 publication Critical patent/US20080160943A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing

Definitions

  • the present general inventive concept relates to a method and apparatus to post-process an audio signal.
  • the equalizer filters allow sound reproduction devices to output an equalized audio signal corresponding to a selected genre by adjusting a frequency characteristic of an audio signal.
  • music classification according to moods considers not only the frequency characteristic but also timbre, sound intensity, and rhythm of an audio signal.
  • the present general inventive concept provides a method and apparatus to post-process an audio signal, whereby an audio signal can be adjusted based on music classification according to moods.
  • a method of post-processing an audio signal including selecting one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal, and modifying at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
  • the modifying may include increasing or decreasing a sampling rate of the audio signal.
  • the modifying may include converting the audio signal to an audio signal of a frequency domain, moving a position of the audio signal of the frequency domain to a higher frequency band by a predetermined frequency, and amplifying audio signal energy contained in a predetermined frequency band of the moved audio signal.
  • the amplifying of the audio signal energy may include amplifying audio signal energy contained in a frequency band between 2 KHz and 3 KHz.
  • the modifying may include converting the audio signal to an audio signal of a frequency domain, moving a position of the audio signal of the frequency domain to a lower frequency band by a predetermined frequency, and reducing audio signal energy contained in a predetermined frequency band of the moved audio signal.
  • the reducing of the audio signal energy may include reducing audio signal energy contained in a frequency band between 2 KHz and 20 KHz.
  • the modifying may include converting the audio signal to an audio signal of a frequency domain, widening a dynamic range of the audio signal of the frequency domain, and amplifying audio signal energy contained in a predetermined frequency band of the audio signal whose dynamic range is widened.
  • the amplifying of the audio signal energy may include amplifying audio signal energy contained in a predetermined frequency band based on 80 Hz and a predetermined frequency band based on 5 KHz.
  • the modifying may include converting the audio signal to an audio signal of a frequency domain, narrowing a dynamic range of the audio signal of the frequency domain, and amplifying audio signal energy contained in a predetermined frequency band of the audio signal whose dynamic range is narrowed.
  • the amplifying of the audio signal energy may include amplifying audio signal energy contained in a frequency band between 6 KHz and 20 KHz.
  • an apparatus to post-process an audio signal, the apparatus including a mode selector to select one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal, and a processing unit to modify at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
  • a computer readable recording medium storing a computer readable program to execute a method of post-processing an audio signal, the method including selecting one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal, and modifying at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
  • an apparatus to post-process an audio signal including a mode selector to select one of a plurality of audio signal processing modes based on a plurality of different human emotions, and a processing unit to modify at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
  • the plurality of different human emotions may include at least one of exuberance, depression, anxiety, and contentment.
  • the processing unit may move a position of the audio signal to a higher frequency band and amplify the audio signal when the audio signal processing mode corresponding to the human emotion of exuberance is selected.
  • the processing unit may amplify audio signal energy contained in a frequency band between 2 KHz and 3 KHz.
  • the processing unit may move a position of the audio signal to a lower frequency band and reduce the audio signal when the audio signal processing mode corresponding to the human emotion of depression is selected.
  • the processing unit may amplify audio signal energy contained in a frequency band between 2 KHz and 20 KHz.
  • the processing unit may widen a dynamic range of the audio signal and amplify the audio signal when the audio signal processing mode corresponding to the human emotion of anxiety is selected.
  • the processing unit may amplify audio signal energy contained in a frequency band between 80 KHz and 5 KHz.
  • the processing unit may narrow a dynamic range of the audio signal and amplify the audio signal when the audio signal processing mode corresponding to the human emotion of contentment is selected.
  • the processing unit may amplify audio signal energy contained in a frequency band between 6 KHz and 20 KHz.
  • the plurality of human emotions may include at least one of happiness, sadness, restlessness, and serenity.
  • the audio signal processing modes may be based on the Thayer mood model.
  • an apparatus to post-process an audio signal including a mode selector to select one of a plurality of audio signal processing modes based on at least one of timbre, sound intensity, and rhythm of the audio signal, and a processing unit to adjust at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
  • the processing unit may move a position of the audio signal to a higher frequency band and amplify the audio signal in a first mode.
  • the processing unit may move a position of the audio signal to a lower frequency band and reduce the audio signal in a second mode.
  • the processing unit may widen a dynamic range of the audio signal and amplify the audio signal in a third mode.
  • the processing unit may narrow a dynamic range of the audio signal and amplify the audio signal in a fourth mode.
  • the foregoing and/or other aspects and utilities of the present general inventive concept may also be achieved by providing a method of post-processing an audio signal, the method comprising selecting one of a plurality of audio signal processing modes based on a plurality of different human emotions, and modifying at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
  • the plurality of different human emotions may include at least one of exuberance, depression, anxiety, and contentment.
  • the foregoing and/or other aspects and utilities of the present general inventive concept may also be achieved by providing a method of post-processing an audio signal, the method including selecting one of a plurality of audio signal processing modes based on at least one of timbre, sound intensity, and rhythm of the audio signal, and adjusting at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
  • FIG. 1 is a block diagram of an apparatus to post-process an audio signal according to an embodiment of the present general inventive concept
  • FIG. 2 is a diagram illustrating the Thayer mood model
  • FIG. 3 is a table illustrating physical properties of processing modes and processing methods thereof according to an embodiment of the present general inventive concept
  • FIG. 4 is a diagram illustrating an audio signal whose position is moved to a higher frequency band according to an embodiment of the present general inventive concept
  • FIG. 5 is a diagram illustrating a characteristic of a filter used by a processing unit in a first processing mode according to an embodiment of the present general inventive concept
  • FIG. 6 is a diagram illustrating an audio signal whose position is moved to a lower frequency band according to an embodiment of the present general inventive concept
  • FIG. 7 is a diagram illustrating a characteristic of a filter used by a processing unit in a second processing mode according to an embodiment of the present general inventive concept
  • FIG. 8 is a diagram illustrating an audio signal whose dynamic range is widened according to an embodiment of the present general inventive concept
  • FIG. 9 is a diagram illustrating a characteristic of a filter used by a processing unit in a third processing mode according to an embodiment of the present general inventive concept
  • FIG. 10 is a diagram illustrating a method of generating an audio signal having a slow-tempo rhythm in a fourth processing mode according to an embodiment of the present general inventive concept
  • FIG. 11 is a diagram illustrating an audio signal whose dynamic range is narrowed according to an embodiment of the present general inventive concept.
  • FIG. 12 is a diagram illustrating a characteristic of a filter used by a processing unit in a fourth processing mode according to an embodiment of the present general inventive concept.
  • FIG. 1 is a block diagram of an apparatus to post-process an audio signal according to an embodiment of the present general inventive concept.
  • the apparatus includes a mode selector 110 and a processing unit 120 .
  • the mode selector 110 selects one of a plurality of processing modes of the audio signal, which can be classified based on timbre, sound intensity, and rhythm of the audio signal.
  • the processing modes are not limited thereto, and can include other processing modes of an audio signal.
  • One of the plurality of processing modes can be selected when a user inputs a signal to select a predetermined processing mode using an input unit.
  • each processing mode defines physical properties, such as the timbre, the sound intensity, and the rhythm of the audio signal, in order to provide four sound sensations which are based on human emotions.
  • the physical properties of each processing mode are set based on the Thayer mood model.
  • FIG. 2 is a diagram illustrating the Thayer mood model.
  • the Thayer mood model defines the four processing modes by analyzing how stress and energy influence human emotions. That is, when a listener listens to music having a high energy level, an amount of epinephrine which is secreted in the listener's brain increases, and thus, the listener experiences both a desire to move or be active and a feeling of exuberance. Accordingly, the listener may be moved to sing, beat time, or dance along the music. In contrast, music having a high stress level causes a listener's brain to secrete cortisol, which is commonly known as a hormone associated with stress. Accordingly, the listener may experience an unstable emotion, such as depression or anxiety.
  • Thayer defined the four processing modes using the correlations described above, wherein a first mode represents “exuberance”, a second mode “depression”, a third mode “anxious/frantic”, and a fourth mode “contentment”.
  • the first mode representing “exuberance” is a mode which is set to represent a pleasantly-energetic and joyful (i.e., happy) emotion.
  • An audio signal in the “exuberance” mode is in a state of low stress and high energy. Therefore, the “exuberance” mode can be noisy and vigorous due to dynamic and high energy qualities of the audio signal, but can still exude a low level of stress on a listener due to a low level of stress qualities of the audio signal.
  • An example of music providing the emotion of the first mode is Rossini's “William Tell Overture.”
  • the second mode representing “depression” is a mode which is set to represent a calm and somewhat sad emotion.
  • An audio signal in the “depression” mode is in a state of high stress and low energy.
  • An example of music providing the emotion of the second mode is Stravinsky's “Firebird.”
  • the third mode representing “anxious/frantic” is a mode which is set to represent an eruptive and frenzied emotion (e.g., restlessness).
  • An audio signal in the “anxious/frantic” mode is in a state of high stress and high energy.
  • An example of music providing the emotion of the third mode is Berg's “Lulu.”
  • general “rock,” heavy-metal,” and “punk” music correspond to the third mode.
  • the fourth mode representing “contentment” is a mode which is set to represent a very joyful, serene, and positive emotion.
  • An audio signal in the “contentment” mode is in a state of low stress and low energy.
  • An example of music providing the emotion of the fourth mode is Bach's “Jesus, Joy of Man's Desire.”
  • Each of the four processing modes can be defined by the physical properties mentioned above, such as a tone (a major or a minor), timbre, sound intensity, and rhythm of an audio signal.
  • tone will be excluded due to difficulty in tone identification. Therefore, in the present embodiment, each of the four modes is defined using the three properties mentioned above, excluding the tone, and processing is performed according to a selected mode.
  • the timbre and the rhythm of the audio signal are related to the stress level, and the sound intensity of the audio signal is related to the energy level.
  • the sound intensity uses Root Mean Square (RMS) power to form the audio signal, where high sound intensity makes the sound heard high, and low sound intensity makes the sound heard low.
  • RMS Root Mean Square
  • the timbre indicates whether a sound produced by the audio signal is bright or heavy, or whether a sound pitch is high or low.
  • the rhythm indicates whether a tempo of the audio signal is quick or slow, or whether strength of a sound produced by the audio signal is strong or weak.
  • the processing unit 120 of FIG. 2 can modify, for example, the timbre, the sound intensity, and the rhythm of the audio signal based on the selected processing mode.
  • the audio signal may be output in real-time through a radio or other media player.
  • the audio signal can be in a stored file, such as an mp3 file, to be output at a later time by a media player.
  • FIG. 3 is a table illustrating physical properties of processing modes and processing methods thereof according to an embodiment of the present general inventive concept.
  • a first processing mode is a mode representing “exuberance,” and is set to process an audio signal to have intermediate sound intensity, high-pitch timbre, and a quick-tempo rhythm.
  • the processing unit 120 of FIG. 2 has two processing methods to implement the physical properties set as the first processing mode, wherein one is a processing method in a time domain, and the other is a processing method in a frequency domain.
  • the processing unit 120 increases a sampling rate of the audio signal.
  • the sampling rate is a rate of converting a sound to samples corresponding to 1 second.
  • a sampling rate of 44.1 KHz means that a sound is sampled 44100 times for 1 second.
  • An increase of the sampling rate means that more samples are obtained for 1 second. Why the sampling rate is increased is because an increase of the sampling rate results in high-pitch timbre and a quick tempo.
  • the method of increasing the sampling rate cannot be applied to an audio signal which is output in real-time, but can be applied to a stored audio signal, such as an mp3 file.
  • the processing unit 120 moves a position of the audio signal to a higher frequency band and amplifies audio signal energy corresponding to a frequency band between 2 KHz and 3 KHz.
  • FIG. 4 is a diagram illustrating an audio signal whose position is moved to a higher frequency band according to an embodiment of the present general inventive concept.
  • the audio signal 420 can be obtained by multiplying a frequency of the original audio signal 410 by 2.
  • a signal located at 20 Hz is moved to 40 Hz, and a signal located at 1 KHz is moved to 2 KHz. That is, if an audio signal is moved to a higher frequency band, the audio signal has high-pitch timbre.
  • the method of moving a frequency band is not limited thereto. That is, all methods which can move a frequency band can be used.
  • FIG. 5 is a diagram illustrating a characteristic of a filter which is used by the processing unit 120 of FIG. 2 in the first processing mode of FIG. 3 , according to an embodiment of the present general inventive concept.
  • FIG. 5 illustrates a graph in which a function value gradually decreases based on 2 KHz. If an audio signal is filtered using the filter in the first processing mode, as illustrated in FIG. 5 , energy of the audio signal in the frequency band between 2 KHz and 3 KHz can be increased.
  • a second processing mode is a mode representing “depression,” and is set to process an audio signal to have low sound intensity, low-pitch timbre, and a slow-tempo rhythm.
  • the processing unit 120 has two processing methods to implement the physical properties in the second processing mode, wherein one is a processing method in the time domain, and the other is a processing method in the frequency domain.
  • the processing unit 120 decreases a sampling rate of the audio signal.
  • the sampling rate is decreased because a decrease of the sampling rate results in low-pitch timbre and a slow tempo.
  • the method of decreasing the sampling rate cannot be applied to an audio signal which is output in real-time, but can be applied to a stored audio signal, such as an mp3 file.
  • the processing unit 120 moves a position of the audio signal to a lower frequency band and reduces audio signal energy corresponding to a frequency band between 2 KHz and 20 KHz.
  • the movement and the reduction operations of the audio signal may be performed in any order.
  • FIG. 6 is a diagram illustrating an audio signal whose position is moved to a lower frequency band according to an embodiment of the present general inventive concept.
  • an original audio signal 610 which is represented by a sine wave and an audio signal 620 which is obtained by moving a position of the original audio signal 610 are illustrated.
  • the audio signal 620 can be obtained by multiplying a frequency of the original audio signal 610 by 0.5.
  • a signal which located at 20 Hz is moved to 10 Hz, and a signal located at 1 KHz is moved to 500 Hz. That is, if an audio signal is moved to a lower frequency band, the audio signal has low-pitch timbre.
  • FIG. 7 is a diagram illustrating a characteristic of a filter used by the processing unit 120 of FIG. 2 in the second processing mode of FIG. 3 , according to an embodiment of the present general inventive concept.
  • FIG. 7 illustrates a graph in which a function value in the frequency band between 2 KHz and 20 KHz gradually decreases according to an increase of a frequency. If an audio signal is filtered using the filter in the second processing mode, as illustrated in FIG. 7 , energy of the audio signal in the frequency band between 2 KHz and 20 KHz can be decreased.
  • a third processing mode is a mode representing an “anxious/frantic” emotion, and is set to process an audio signal to have high sound intensity and a strong rhythm.
  • the processing unit 120 widens a dynamic range of the audio signal and amplifies audio signal energy corresponding to frequency bands around 80 Hz and 5 KHz.
  • the widening and the amplification operations of the audio signal may be performed in any order.
  • FIG. 8 is a diagram illustrating an audio signal whose dynamic range is widened according to an embodiment of the present general inventive concept.
  • the dynamic range is a range between a function value in which energy of an audio signal is maximized and a function value in which the energy of the audio signal is minimized.
  • a dynamic range of an original audio signal 810 is between 40 dB and 80 dB
  • the processing unit 120 of FIG. 2 widens the dynamic range to a range between 20 dB and 100 dB as illustrated by a widened audio signal 820 . If the dynamic range is widened, an entire energy level can be increased while maintaining musical quality.
  • FIG. 9 is a diagram illustrating a characteristic of a filter which is used by the processing unit 120 of FIG. 2 in the third processing mode of FIG. 3 , according to an embodiment of the present general inventive concept.
  • FIG. 9 illustrates a graph in which magnitudes of function values in frequency bands around 80 Hz and 5 KHz are relatively greater. If an audio signal is filtered using the filter in the third processing mode, as illustrated in FIG. 9 , energy of the audio signal in the frequency bands around 80 Hz and 5 KHz can be increased.
  • a fourth processing mode is a mode representing “contentment,” and is set to process an audio signal to have physical properties of low sound intensity, bright timbre, and a slow-tempo rhythm.
  • the processing unit 120 narrows a dynamic range of the audio signal and) amplifies audio signal energy corresponding to a frequency band between 6 KHz and 20 KHz.
  • the narrowing and the amplification operations of the audio signal may be performed in any order.
  • the processing unit 120 can implement a low-tempo rhythm in a method of reducing a time domain of the audio signal processed by the narrowing and amplification operations of the audio signal using a Time Domain Pitch Synchronous Overlap-Add (TD-PSOLA) method.
  • TD-PSOLA Time Domain Pitch Synchronous Overlap-Add
  • the TD-PSOLA method cannot be applied to an audio signal which is output in real-time, but can be applied to a stored audio signal, such as an mp3 file.
  • FIG. 10 is a diagram illustrating a method of generating an audio signal having a slow-tempo rhythm in the fourth processing mode of FIG. 3 , according to an embodiment of the present general inventive concept.
  • FIG. 10 is a diagram illustrating a schematic concept of the TD-PSOLA method and illustrates a method of decreasing tempo speed by processing an audio signal having a reproduction time of 3 minutes to have a reproduction time of 4 minutes.
  • an entire duration of an audio signal is divided into 3 time durations A, B, and C (as denoted by reference number 1010 ).
  • Two time durations D and E are added to the 3 time durations A, B, and C (as denoted by reference number 1020 ).
  • the two additional time durations D and E can be added using a mean value of the audio signal or a value which is obtained by copying an audio signal before each of the additional time durations are added.
  • An audio signal having a reproduction time of 4 minutes is generated by synthesizing the five time durations (as denoted by reference number 1030 ).
  • FIG. 11 is a diagram illustrating an audio signal whose dynamic range is narrowed according to an embodiment of the present general inventive concept.
  • the processing unit 120 reduces the dynamic range to a range between 50 dB and 70 dB illustrated by a narrowed audio signal 1120 . If a dynamic range is narrowed, an entire energy level can be decreased due to a decrease of an alteration range of the audio signal.
  • FIG. 12 is a diagram illustrating a characteristic of a filter used by the processing unit 120 of FIG. 2 in the fourth processing mode of FIG. 3 , according to an embodiment of the present general inventive concept.
  • FIG. 12 illustrates a graph in which a function value in a frequency band above 6 KHz gradually increases according to an increase of a frequency. If an audio signal is filtered using the filter in the fourth processing mode, as illustrated in FIG. 12 , energy of the audio signal in the frequency band above 6 KHz can be increased, resulting in bright timbre of the audio signal.
  • the present general inventive concept can also be embodied as computer-readable codes on a computer-readable medium.
  • the computer-readable medium can include a computer-readable recording medium and a computer-readable transmission medium.
  • the computer-readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer-readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage devices.
  • the computer-readable recording medium can also be distributed over network coupled computer systems so that the computer-readable code is stored and executed in a distributed fashion.
  • the computer-readable transmission medium can transmit carrier waves or signals (e.g., wired or wireless data transmission through the Internet). Also, functional programs, codes, and code segments to accomplish the present general inventive concept can be easily construed by programmers skilled in the art to which the present general inventive concept pertains.
  • the audio signal by modifying timbre, sound intensity, and rhythm of an audio signal, based on audio signal processing modes which are classified based on the timbre, the sound intensity, and the rhythm of the audio signal, the audio signal can be adjusted based on music classification according to moods.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

A method and apparatus to post process an audio signal. The method includes selecting one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal, and modifying at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application claims priority under 35 U.S.C. §119(a) from Korean Patent Application No. 10-2006-0134984, filed on Dec. 27, 2006, in the Korean Intellectual Property Office, the disclosure of which is incorporated herein in its entirety by reference.
  • BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • The present general inventive concept relates to a method and apparatus to post-process an audio signal.
  • 2. Description of the Related Art
  • Music has been classified using genre information (classic, jazz, rock, etc.) which is provided by producers, and most sound reproduction devices have adopted equalizer filters which are based on genre classification. The equalizer filters allow sound reproduction devices to output an equalized audio signal corresponding to a selected genre by adjusting a frequency characteristic of an audio signal.
  • However, since a number of digital music sources is constantly increasing, thereby resulting in an enormous music database, audiences need various access methods to search for desired music. One of the access methods includes classifying music according to moods. Audio sensitivity has become an increasingly important element for customers, and thus, a large variety of sound effects of an audio signal is necessary.
  • Unlike conventional music classification methods, music classification according to moods considers not only the frequency characteristic but also timbre, sound intensity, and rhythm of an audio signal.
  • However, according to conventional equalizing methods, since only the frequency characteristic can be adjusted, an audio signal cannot be adjusted based on the music classification according to moods.
  • SUMMARY OF THE INVENTION
  • The present general inventive concept provides a method and apparatus to post-process an audio signal, whereby an audio signal can be adjusted based on music classification according to moods.
  • Additional aspects and utilities of the present general inventive concept will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the general inventive concept.
  • The foregoing and/or other aspects and utilities of the present general inventive concept are achieved by providing a method of post-processing an audio signal, the method including selecting one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal, and modifying at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
  • The modifying may include increasing or decreasing a sampling rate of the audio signal.
  • The modifying may include converting the audio signal to an audio signal of a frequency domain, moving a position of the audio signal of the frequency domain to a higher frequency band by a predetermined frequency, and amplifying audio signal energy contained in a predetermined frequency band of the moved audio signal.
  • The amplifying of the audio signal energy may include amplifying audio signal energy contained in a frequency band between 2 KHz and 3 KHz.
  • The modifying may include converting the audio signal to an audio signal of a frequency domain, moving a position of the audio signal of the frequency domain to a lower frequency band by a predetermined frequency, and reducing audio signal energy contained in a predetermined frequency band of the moved audio signal.
  • The reducing of the audio signal energy may include reducing audio signal energy contained in a frequency band between 2 KHz and 20 KHz.
  • The modifying may include converting the audio signal to an audio signal of a frequency domain, widening a dynamic range of the audio signal of the frequency domain, and amplifying audio signal energy contained in a predetermined frequency band of the audio signal whose dynamic range is widened.
  • The amplifying of the audio signal energy may include amplifying audio signal energy contained in a predetermined frequency band based on 80 Hz and a predetermined frequency band based on 5 KHz.
  • The modifying may include converting the audio signal to an audio signal of a frequency domain, narrowing a dynamic range of the audio signal of the frequency domain, and amplifying audio signal energy contained in a predetermined frequency band of the audio signal whose dynamic range is narrowed.
  • The amplifying of the audio signal energy may include amplifying audio signal energy contained in a frequency band between 6 KHz and 20 KHz.
  • The foregoing and/or other aspects and utilities of the present general inventive concept may also be achieved by providing an apparatus to post-process an audio signal, the apparatus including a mode selector to select one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal, and a processing unit to modify at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
  • The foregoing and/or other aspects and utilities of the present general inventive concept may also be achieved by providing a computer readable recording medium storing a computer readable program to execute a method of post-processing an audio signal, the method including selecting one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal, and modifying at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
  • The foregoing and/or other aspects and utilities of the present general inventive concept may also be achieved by providing an apparatus to post-process an audio signal, the apparatus including a mode selector to select one of a plurality of audio signal processing modes based on a plurality of different human emotions, and a processing unit to modify at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
  • The plurality of different human emotions may include at least one of exuberance, depression, anxiety, and contentment.
  • The processing unit may move a position of the audio signal to a higher frequency band and amplify the audio signal when the audio signal processing mode corresponding to the human emotion of exuberance is selected.
  • The processing unit may amplify audio signal energy contained in a frequency band between 2 KHz and 3 KHz.
  • The processing unit may move a position of the audio signal to a lower frequency band and reduce the audio signal when the audio signal processing mode corresponding to the human emotion of depression is selected.
  • The processing unit may amplify audio signal energy contained in a frequency band between 2 KHz and 20 KHz.
  • The processing unit may widen a dynamic range of the audio signal and amplify the audio signal when the audio signal processing mode corresponding to the human emotion of anxiety is selected.
  • The processing unit may amplify audio signal energy contained in a frequency band between 80 KHz and 5 KHz.
  • The processing unit may narrow a dynamic range of the audio signal and amplify the audio signal when the audio signal processing mode corresponding to the human emotion of contentment is selected.
  • The processing unit may amplify audio signal energy contained in a frequency band between 6 KHz and 20 KHz.
  • The plurality of human emotions may include at least one of happiness, sadness, restlessness, and serenity.
  • The audio signal processing modes may be based on the Thayer mood model.
  • The foregoing and/or other aspects and utilities of the present general inventive concept may also be achieved by providing an apparatus to post-process an audio signal, the apparatus including a mode selector to select one of a plurality of audio signal processing modes based on at least one of timbre, sound intensity, and rhythm of the audio signal, and a processing unit to adjust at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
  • The processing unit may move a position of the audio signal to a higher frequency band and amplify the audio signal in a first mode.
  • The processing unit may move a position of the audio signal to a lower frequency band and reduce the audio signal in a second mode.
  • The processing unit may widen a dynamic range of the audio signal and amplify the audio signal in a third mode.
  • The processing unit may narrow a dynamic range of the audio signal and amplify the audio signal in a fourth mode.
  • The foregoing and/or other aspects and utilities of the present general inventive concept may also be achieved by providing a method of post-processing an audio signal, the method comprising selecting one of a plurality of audio signal processing modes based on a plurality of different human emotions, and modifying at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
  • The plurality of different human emotions may include at least one of exuberance, depression, anxiety, and contentment.
  • The foregoing and/or other aspects and utilities of the present general inventive concept may also be achieved by providing a method of post-processing an audio signal, the method including selecting one of a plurality of audio signal processing modes based on at least one of timbre, sound intensity, and rhythm of the audio signal, and adjusting at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • These and/or other aspects and utilities of the present general inventive concept will become apparent and more readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
  • FIG. 1 is a block diagram of an apparatus to post-process an audio signal according to an embodiment of the present general inventive concept;
  • FIG. 2 is a diagram illustrating the Thayer mood model;
  • FIG. 3 is a table illustrating physical properties of processing modes and processing methods thereof according to an embodiment of the present general inventive concept;
  • FIG. 4 is a diagram illustrating an audio signal whose position is moved to a higher frequency band according to an embodiment of the present general inventive concept;
  • FIG. 5 is a diagram illustrating a characteristic of a filter used by a processing unit in a first processing mode according to an embodiment of the present general inventive concept;
  • FIG. 6 is a diagram illustrating an audio signal whose position is moved to a lower frequency band according to an embodiment of the present general inventive concept;
  • FIG. 7 is a diagram illustrating a characteristic of a filter used by a processing unit in a second processing mode according to an embodiment of the present general inventive concept;
  • FIG. 8 is a diagram illustrating an audio signal whose dynamic range is widened according to an embodiment of the present general inventive concept;
  • FIG. 9 is a diagram illustrating a characteristic of a filter used by a processing unit in a third processing mode according to an embodiment of the present general inventive concept;
  • FIG. 10 is a diagram illustrating a method of generating an audio signal having a slow-tempo rhythm in a fourth processing mode according to an embodiment of the present general inventive concept;
  • FIG. 11 is a diagram illustrating an audio signal whose dynamic range is narrowed according to an embodiment of the present general inventive concept; and
  • FIG. 12 is a diagram illustrating a characteristic of a filter used by a processing unit in a fourth processing mode according to an embodiment of the present general inventive concept.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • Reference will now be made in detail to the embodiments of the present general inventive concept, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The embodiments are described below in order to explain the present general inventive concept by referring to the figures.
  • FIG. 1 is a block diagram of an apparatus to post-process an audio signal according to an embodiment of the present general inventive concept.
  • Referring to FIG. 1, the apparatus includes a mode selector 110 and a processing unit 120.
  • The mode selector 110 selects one of a plurality of processing modes of the audio signal, which can be classified based on timbre, sound intensity, and rhythm of the audio signal. However, the processing modes are not limited thereto, and can include other processing modes of an audio signal.
  • One of the plurality of processing modes can be selected when a user inputs a signal to select a predetermined processing mode using an input unit.
  • In the current embodiment, four processing modes are supported, where each processing mode defines physical properties, such as the timbre, the sound intensity, and the rhythm of the audio signal, in order to provide four sound sensations which are based on human emotions. The physical properties of each processing mode are set based on the Thayer mood model.
  • FIG. 2 is a diagram illustrating the Thayer mood model.
  • The Thayer mood model defines the four processing modes by analyzing how stress and energy influence human emotions. That is, when a listener listens to music having a high energy level, an amount of epinephrine which is secreted in the listener's brain increases, and thus, the listener experiences both a desire to move or be active and a feeling of exuberance. Accordingly, the listener may be moved to sing, beat time, or dance along the music. In contrast, music having a high stress level causes a listener's brain to secrete cortisol, which is commonly known as a hormone associated with stress. Accordingly, the listener may experience an unstable emotion, such as depression or anxiety.
  • Thayer defined the four processing modes using the correlations described above, wherein a first mode represents “exuberance”, a second mode “depression”, a third mode “anxious/frantic”, and a fourth mode “contentment”.
  • The first mode representing “exuberance” is a mode which is set to represent a pleasantly-energetic and joyful (i.e., happy) emotion. An audio signal in the “exuberance” mode is in a state of low stress and high energy. Therefore, the “exuberance” mode can be noisy and vigorous due to dynamic and high energy qualities of the audio signal, but can still exude a low level of stress on a listener due to a low level of stress qualities of the audio signal. An example of music providing the emotion of the first mode is Rossini's “William Tell Overture.”
  • The second mode representing “depression” is a mode which is set to represent a calm and somewhat sad emotion. An audio signal in the “depression” mode is in a state of high stress and low energy. An example of music providing the emotion of the second mode is Stravinsky's “Firebird.”
  • The third mode representing “anxious/frantic” is a mode which is set to represent an eruptive and frenzied emotion (e.g., restlessness). An audio signal in the “anxious/frantic” mode is in a state of high stress and high energy. An example of music providing the emotion of the third mode is Berg's “Lulu.” In addition, general “rock,” heavy-metal,” and “punk” music correspond to the third mode.
  • The fourth mode representing “contentment” is a mode which is set to represent a very joyful, serene, and positive emotion. An audio signal in the “contentment” mode is in a state of low stress and low energy. An example of music providing the emotion of the fourth mode is Bach's “Jesus, Joy of Man's Desire.”
  • Each of the four processing modes can be defined by the physical properties mentioned above, such as a tone (a major or a minor), timbre, sound intensity, and rhythm of an audio signal. In this embodiment, tone will be excluded due to difficulty in tone identification. Therefore, in the present embodiment, each of the four modes is defined using the three properties mentioned above, excluding the tone, and processing is performed according to a selected mode.
  • The timbre and the rhythm of the audio signal are related to the stress level, and the sound intensity of the audio signal is related to the energy level. The sound intensity uses Root Mean Square (RMS) power to form the audio signal, where high sound intensity makes the sound heard high, and low sound intensity makes the sound heard low. The timbre indicates whether a sound produced by the audio signal is bright or heavy, or whether a sound pitch is high or low. The rhythm indicates whether a tempo of the audio signal is quick or slow, or whether strength of a sound produced by the audio signal is strong or weak.
  • The processing unit 120 of FIG. 2 can modify, for example, the timbre, the sound intensity, and the rhythm of the audio signal based on the selected processing mode. The audio signal may be output in real-time through a radio or other media player. In addition, the audio signal can be in a stored file, such as an mp3 file, to be output at a later time by a media player.
  • An operation of the processing unit 120 will now be described in detail with reference to FIGS. 3 through 11.
  • FIG. 3 is a table illustrating physical properties of processing modes and processing methods thereof according to an embodiment of the present general inventive concept.
  • Referring to FIG. 3, a first processing mode is a mode representing “exuberance,” and is set to process an audio signal to have intermediate sound intensity, high-pitch timbre, and a quick-tempo rhythm.
  • The processing unit 120 of FIG. 2 has two processing methods to implement the physical properties set as the first processing mode, wherein one is a processing method in a time domain, and the other is a processing method in a frequency domain.
  • In the time domain, the processing unit 120 increases a sampling rate of the audio signal. The sampling rate is a rate of converting a sound to samples corresponding to 1 second. For example, a sampling rate of 44.1 KHz means that a sound is sampled 44100 times for 1 second. An increase of the sampling rate means that more samples are obtained for 1 second. Why the sampling rate is increased is because an increase of the sampling rate results in high-pitch timbre and a quick tempo.
  • However, the method of increasing the sampling rate cannot be applied to an audio signal which is output in real-time, but can be applied to a stored audio signal, such as an mp3 file.
  • In the frequency domain, the processing unit 120 moves a position of the audio signal to a higher frequency band and amplifies audio signal energy corresponding to a frequency band between 2 KHz and 3 KHz.
  • FIG. 4 is a diagram illustrating an audio signal whose position is moved to a higher frequency band according to an embodiment of the present general inventive concept.
  • In FIG. 4, an original audio signal 410 assumed as a sine wave and an audio signal 420, which is obtained by moving a position of the original audio signal 410, are illustrated. The audio signal 420 can be obtained by multiplying a frequency of the original audio signal 410 by 2.
  • Accordingly, a signal located at 20 Hz is moved to 40 Hz, and a signal located at 1 KHz is moved to 2 KHz. That is, if an audio signal is moved to a higher frequency band, the audio signal has high-pitch timbre.
  • However, the method of moving a frequency band is not limited thereto. That is, all methods which can move a frequency band can be used.
  • FIG. 5 is a diagram illustrating a characteristic of a filter which is used by the processing unit 120 of FIG. 2 in the first processing mode of FIG. 3, according to an embodiment of the present general inventive concept.
  • FIG. 5 illustrates a graph in which a function value gradually decreases based on 2 KHz. If an audio signal is filtered using the filter in the first processing mode, as illustrated in FIG. 5, energy of the audio signal in the frequency band between 2 KHz and 3 KHz can be increased.
  • Referring back to FIG. 3, a second processing mode is a mode representing “depression,” and is set to process an audio signal to have low sound intensity, low-pitch timbre, and a slow-tempo rhythm.
  • Like in the first processing mode, the processing unit 120 has two processing methods to implement the physical properties in the second processing mode, wherein one is a processing method in the time domain, and the other is a processing method in the frequency domain.
  • In the time domain, the processing unit 120 decreases a sampling rate of the audio signal. The sampling rate is decreased because a decrease of the sampling rate results in low-pitch timbre and a slow tempo.
  • However, the method of decreasing the sampling rate cannot be applied to an audio signal which is output in real-time, but can be applied to a stored audio signal, such as an mp3 file.
  • In the frequency domain, the processing unit 120 moves a position of the audio signal to a lower frequency band and reduces audio signal energy corresponding to a frequency band between 2 KHz and 20 KHz. The movement and the reduction operations of the audio signal may be performed in any order.
  • FIG. 6 is a diagram illustrating an audio signal whose position is moved to a lower frequency band according to an embodiment of the present general inventive concept.
  • In FIG. 6, an original audio signal 610 which is represented by a sine wave and an audio signal 620 which is obtained by moving a position of the original audio signal 610 are illustrated. The audio signal 620 can be obtained by multiplying a frequency of the original audio signal 610 by 0.5.
  • Accordingly, a signal which located at 20 Hz is moved to 10 Hz, and a signal located at 1 KHz is moved to 500 Hz. That is, if an audio signal is moved to a lower frequency band, the audio signal has low-pitch timbre.
  • FIG. 7 is a diagram illustrating a characteristic of a filter used by the processing unit 120 of FIG. 2 in the second processing mode of FIG. 3, according to an embodiment of the present general inventive concept.
  • FIG. 7 illustrates a graph in which a function value in the frequency band between 2 KHz and 20 KHz gradually decreases according to an increase of a frequency. If an audio signal is filtered using the filter in the second processing mode, as illustrated in FIG. 7, energy of the audio signal in the frequency band between 2 KHz and 20 KHz can be decreased.
  • Referring back to FIG. 3, a third processing mode is a mode representing an “anxious/frantic” emotion, and is set to process an audio signal to have high sound intensity and a strong rhythm.
  • In order to implement the physical properties set as the third processing mode, the processing unit 120 widens a dynamic range of the audio signal and amplifies audio signal energy corresponding to frequency bands around 80 Hz and 5 KHz. The widening and the amplification operations of the audio signal may be performed in any order.
  • FIG. 8 is a diagram illustrating an audio signal whose dynamic range is widened according to an embodiment of the present general inventive concept.
  • The dynamic range is a range between a function value in which energy of an audio signal is maximized and a function value in which the energy of the audio signal is minimized. In FIG. 8, although a dynamic range of an original audio signal 810 is between 40 dB and 80 dB, the processing unit 120 of FIG. 2 widens the dynamic range to a range between 20 dB and 100 dB as illustrated by a widened audio signal 820. If the dynamic range is widened, an entire energy level can be increased while maintaining musical quality.
  • FIG. 9 is a diagram illustrating a characteristic of a filter which is used by the processing unit 120 of FIG. 2 in the third processing mode of FIG. 3, according to an embodiment of the present general inventive concept.
  • FIG. 9 illustrates a graph in which magnitudes of function values in frequency bands around 80 Hz and 5 KHz are relatively greater. If an audio signal is filtered using the filter in the third processing mode, as illustrated in FIG. 9, energy of the audio signal in the frequency bands around 80 Hz and 5 KHz can be increased.
  • Referring back to FIG. 3, a fourth processing mode is a mode representing “contentment,” and is set to process an audio signal to have physical properties of low sound intensity, bright timbre, and a slow-tempo rhythm.
  • In order to implement the physical properties set as the fourth processing mode, the processing unit 120 narrows a dynamic range of the audio signal and) amplifies audio signal energy corresponding to a frequency band between 6 KHz and 20 KHz. The narrowing and the amplification operations of the audio signal may be performed in any order.
  • In addition, the processing unit 120 can implement a low-tempo rhythm in a method of reducing a time domain of the audio signal processed by the narrowing and amplification operations of the audio signal using a Time Domain Pitch Synchronous Overlap-Add (TD-PSOLA) method. However, the TD-PSOLA method cannot be applied to an audio signal which is output in real-time, but can be applied to a stored audio signal, such as an mp3 file.
  • FIG. 10 is a diagram illustrating a method of generating an audio signal having a slow-tempo rhythm in the fourth processing mode of FIG. 3, according to an embodiment of the present general inventive concept.
  • FIG. 10 is a diagram illustrating a schematic concept of the TD-PSOLA method and illustrates a method of decreasing tempo speed by processing an audio signal having a reproduction time of 3 minutes to have a reproduction time of 4 minutes.
  • Referring to FIG. 10, in the time domain, an entire duration of an audio signal is divided into 3 time durations A, B, and C (as denoted by reference number 1010). Two time durations D and E are added to the 3 time durations A, B, and C (as denoted by reference number 1020). The two additional time durations D and E can be added using a mean value of the audio signal or a value which is obtained by copying an audio signal before each of the additional time durations are added. An audio signal having a reproduction time of 4 minutes is generated by synthesizing the five time durations (as denoted by reference number 1030).
  • Since the TD-PSOLA method is well known to one of ordinary skill in the art, a detailed description thereof is omitted.
  • FIG. 11 is a diagram illustrating an audio signal whose dynamic range is narrowed according to an embodiment of the present general inventive concept.
  • In FIG. 11, although a dynamic range of an original audio signal 1110 is between 40 dB and 80 dB, the processing unit 120 reduces the dynamic range to a range between 50 dB and 70 dB illustrated by a narrowed audio signal 1120. If a dynamic range is narrowed, an entire energy level can be decreased due to a decrease of an alteration range of the audio signal.
  • FIG. 12 is a diagram illustrating a characteristic of a filter used by the processing unit 120 of FIG. 2 in the fourth processing mode of FIG. 3, according to an embodiment of the present general inventive concept.
  • FIG. 12 illustrates a graph in which a function value in a frequency band above 6 KHz gradually increases according to an increase of a frequency. If an audio signal is filtered using the filter in the fourth processing mode, as illustrated in FIG. 12, energy of the audio signal in the frequency band above 6 KHz can be increased, resulting in bright timbre of the audio signal.
  • The present general inventive concept can also be embodied as computer-readable codes on a computer-readable medium. The computer-readable medium can include a computer-readable recording medium and a computer-readable transmission medium. The computer-readable recording medium is any data storage device that can store data which can be thereafter read by a computer system. Examples of the computer-readable recording medium include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage devices. The computer-readable recording medium can also be distributed over network coupled computer systems so that the computer-readable code is stored and executed in a distributed fashion. The computer-readable transmission medium can transmit carrier waves or signals (e.g., wired or wireless data transmission through the Internet). Also, functional programs, codes, and code segments to accomplish the present general inventive concept can be easily construed by programmers skilled in the art to which the present general inventive concept pertains.
  • As described above, according to the present general inventive concept, by modifying timbre, sound intensity, and rhythm of an audio signal, based on audio signal processing modes which are classified based on the timbre, the sound intensity, and the rhythm of the audio signal, the audio signal can be adjusted based on music classification according to moods.
  • Although a few embodiments of the present general inventive concept have been shown and described, it will be appreciated by those skilled in the art that changes may be made in these embodiments without departing from the principles and spirit of the general inventive concept, the scope of which is defined in the appended claims and their equivalents.

Claims (41)

1. A method of post-processing an audio signal, the method comprising:
selecting one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal; and
modifying at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
2. The method of claim 1, wherein the modifying comprises increasing or decreasing a sampling rate of the audio signal.
3. The method of claim 1, wherein the modifying comprises:
converting the audio signal to an audio signal of a frequency domain;
moving a position of the audio signal of the frequency domain to a higher frequency band by a predetermined frequency; and
amplifying audio signal energy contained in a predetermined frequency band of the moved audio signal.
4. The method of claim 3, wherein the amplifying of the audio signal energy comprises amplifying audio signal energy contained in a frequency band between 2 KHz and 3 KHz.
5. The method of claim 1, wherein the modifying comprises:
converting the audio signal to an audio signal of a frequency domain;
moving a position of the audio signal of the frequency domain to a lower frequency band by a predetermined frequency; and
reducing audio signal energy contained in a predetermined frequency band of the moved audio signal.
6. The method of claim 5, wherein the reducing of the audio signal energy comprises reducing audio signal energy contained in a frequency band between 2 KHz and 20 KHz.
7. The method of claim 1, wherein the modifying comprises:
converting the audio signal to an audio signal of a frequency domain;
widening a dynamic range of the audio signal of the frequency domain; and
amplifying audio signal energy contained in a predetermined frequency band of the audio signal whose dynamic range is widened.
8. The method of claim 7, wherein the amplifying of the audio signal energy comprises amplifying audio signal energy contained in a predetermined frequency band based on 80 Hz and a predetermined frequency band based on 5 KHz.
9. The method of claim 1, wherein the modifying comprises:
converting the audio signal to an audio signal of a frequency domain;
narrowing a dynamic range of the audio signal of the frequency domain; and
amplifying audio signal energy contained in a predetermined frequency band of the audio signal whose dynamic range is narrowed.
10. The method of claim 9, wherein the amplifying of the audio signal energy comprises amplifying audio signal energy contained in a frequency band between 6 KHz and 20 KHz.
11. An apparatus to post-process an audio signal, the apparatus comprising:
a mode selector to select one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal; and
a processing unit to modify at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
12. The apparatus of claim 11, wherein the processing unit increases or decreases a sampling rate of the audio signal.
13. The apparatus of claim 11, wherein the processing unit converts the audio signal to an audio signal of a frequency domain, moves a position of the audio signal of the frequency domain to a higher frequency band by a predetermined frequency, and amplifies audio signal energy contained in a predetermined frequency band of the moved audio signal.
14. The apparatus of claim 13, wherein the processing unit amplifies audio signal energy contained in a frequency band between 2 KHz and 3 KHz.
15. The apparatus of claim 11, wherein the processing unit converts the audio signal to an audio signal of a frequency domain, moves a position of the audio signal of the frequency domain to a lower frequency band by a predetermined frequency, and reduces audio signal energy contained in a predetermined frequency band of the moved audio signal.
16. The apparatus of claim 15, wherein the processing unit reduces audio signal energy contained in a frequency band between 2 KHz and 20 KHz.
17. The apparatus of claim 11, wherein the processing unit converts the audio signal to an audio signal of a frequency domain, widens a dynamic range of the audio signal of the frequency domain, and amplifies audio signal energy contained in a predetermined frequency band of the audio signal whose dynamic range is widened.
18. The apparatus of claim 17, wherein the processing unit amplifies audio signal energy contained in a predetermined frequency band based on 80 Hz and a predetermined frequency band based on 5 KHz.
19. The apparatus of claim 11, wherein the processing unit converts the audio signal to an audio signal of a frequency domain, narrows a dynamic range of the audio signal of the frequency domain, and amplifies audio signal energy contained in a predetermined frequency band of the audio signal whose dynamic range is narrowed.
20. The apparatus of claim 19, wherein the processing unit amplifies audio signal energy contained in a frequency band between 6 KHz and 20 KHz.
21. A computer readable recording medium having embodied thereon a computer program to execute a method, wherein the method comprises:
selecting one of a plurality of audio signal processing modes classified based on at least one of timbre, sound intensity, and rhythm of the audio signal; and
modifying at least one of the timbre, the sound intensity, and the rhythm of the audio signal based on the selected audio signal processing mode.
22. An apparatus to post-process an audio signal, the apparatus comprising:
a mode selector to select one of a plurality of audio signal processing modes based on a plurality of different human emotions; and
a processing unit to modify at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
23. The apparatus of claim 22, wherein the plurality of different human emotions comprise at least one of exuberance, depression, anxiety, and contentment.
24. The apparatus of claim 23, wherein the processing unit moves a position of the audio signal to a higher frequency band and amplifies the audio signal when the audio signal processing mode corresponding to the human emotion of exuberance is selected.
25. The apparatus of claim 24, wherein the processing unit amplifies audio signal energy contained in a frequency band between 2 KHz and 3 KHz.
26. The apparatus of claim 23, wherein the processing unit moves a position of the audio signal to a lower frequency band and reduces the audio signal when the audio signal processing mode corresponding to the human emotion of depression is selected.
27. The apparatus of claim 26, wherein the processing unit reduces audio signal energy contained in a frequency band between 2 KHz and 20 KHz.
28. The apparatus of claim 23, wherein the processing unit widens a dynamic range of the audio signal and amplifies the audio signal when the audio signal processing mode corresponding to the human emotion of anxiety is selected.
29. The apparatus of claim 28, wherein the processing unit amplifies audio signal energy contained in bands around 80 Hz and 5 KHz.
30. The apparatus of claim 23, wherein the processing unit narrows a dynamic range of the audio signal and amplifies the audio signal when the audio signal processing mode corresponding to the human emotion of contentment is selected.
31. The apparatus of claim 30, wherein the processing unit amplifies audio signal energy contained in a frequency band between 6 KHz and 20 KHz.
32. The apparatus of claim 22, wherein the plurality of human emotions comprise at least one of happiness, sadness, restlessness, and serenity.
33. The apparatus of claim 22, wherein the audio signal processing modes are based on the Thayer mood model.
34. An apparatus to post-process an audio signal, the apparatus comprising:
a mode selector to select one of a plurality of audio signal processing modes based on at least one of timbre, sound intensity, and rhythm of the audio signal; and
a processing unit to adjust at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
35. The apparatus of claim 34, wherein the processing unit moves a position of the audio signal to a higher frequency band and amplifies the audio signal in a first mode.
36. The apparatus of claim 34, wherein the processing unit moves a position of the audio signal to a lower frequency band and reduces the audio signal in a second mode.
37. The apparatus of claim 34, wherein the processing unit widens a dynamic range of the audio signal and amplifies the audio signal in a third mode.
38. The apparatus of claim 34, wherein the processing unit narrows a dynamic range of the audio signal and amplifies the audio signal in a fourth mode.
39. A method of post-processing an audio signal, the method comprising:
selecting one of a plurality of audio signal processing modes based on a plurality of different human emotions; and
modifying at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
40. The method of claim 39, wherein the plurality of different human emotions comprise at least one of exuberance, depression, anxiety, and contentment.
41. A method of post-processing an audio signal, the method comprising:
selecting one of a plurality of audio signal processing modes based on at least one of timbre, sound intensity, and rhythm of the audio signal; and
adjusting at least one of a frequency, a dynamic range, and a sampling rate of the audio signal based on the selected audio signal processing mode.
US11/755,086 2006-12-27 2007-05-30 Method and apparatus to post-process an audio signal Abandoned US20080160943A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR2006-134984 2006-12-27
KR1020060134984A KR20080060641A (en) 2006-12-27 2006-12-27 Post-processing method of audio signal and apparatus therefor

Publications (1)

Publication Number Publication Date
US20080160943A1 true US20080160943A1 (en) 2008-07-03

Family

ID=39584703

Family Applications (1)

Application Number Title Priority Date Filing Date
US11/755,086 Abandoned US20080160943A1 (en) 2006-12-27 2007-05-30 Method and apparatus to post-process an audio signal

Country Status (3)

Country Link
US (1) US20080160943A1 (en)
KR (1) KR20080060641A (en)
CN (1) CN101211557A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011095913A1 (en) * 2010-02-02 2011-08-11 Koninklijke Philips Electronics N.V. Spatial sound reproduction
DK201300471A1 (en) * 2013-08-20 2015-03-02 Bang & Olufsen As System for dynamically modifying car audio system tuning parameters
US20160005415A1 (en) * 2014-07-04 2016-01-07 Arc Co., Ltd. Audio signal processing apparatus and audio signal processing method thereof
WO2017036559A1 (en) * 2015-09-04 2017-03-09 Arcelik Anonim Sirketi Image display device with automatic volume curve pairing function
US10142758B2 (en) 2013-08-20 2018-11-27 Harman Becker Automotive Systems Manufacturing Kft System for and a method of generating sound

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101615262B1 (en) 2009-08-12 2016-04-26 삼성전자주식회사 Method and apparatus for encoding and decoding multi-channel audio signal using semantic information
CN102547524A (en) * 2011-12-15 2012-07-04 无锡中星微电子有限公司 Method and device for processing audio frequency data in Bluetooth earphone and equipment
CN104080024B (en) 2013-03-26 2019-02-19 杜比实验室特许公司 Volume leveller controller and control method and audio classifiers
CN106155470B (en) * 2015-04-21 2019-09-17 阿里巴巴集团控股有限公司 A kind of audio file generation method and device
CN106599057A (en) * 2016-11-18 2017-04-26 上海斐讯数据通信技术有限公司 Music rhythm control terminal and method adaptive to user emotion
CN107818792A (en) * 2017-10-25 2018-03-20 北京奇虎科技有限公司 Audio conversion method and device
CN108600936B (en) * 2018-04-19 2020-01-03 北京微播视界科技有限公司 Multi-channel audio processing method, device, computer-readable storage medium and terminal
CN112863466B (en) * 2021-01-07 2024-05-31 广州欢城文化传媒有限公司 Audio social interaction method and device

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3979989A (en) * 1974-05-31 1976-09-14 Nippon Gakki Seizo Kabushiki Kaisha Electronic musical instrument
US4991218A (en) * 1988-01-07 1991-02-05 Yield Securities, Inc. Digital signal processor for providing timbral change in arbitrary audio and dynamically controlled stored digital audio signals
US5998724A (en) * 1997-10-22 1999-12-07 Yamaha Corporation Tone synthesizing device and method capable of individually imparting effect to each tone to be generated
US6570078B2 (en) * 1998-05-15 2003-05-27 Lester Frank Ludwig Tactile, visual, and array controllers for real-time control of music signal processing, mixing, video, and lighting
US20030131715A1 (en) * 2002-01-04 2003-07-17 Alain Georges Systems and methods for creating, modifying, interacting with and playing musical compositions
US20030159567A1 (en) * 2002-10-18 2003-08-28 Morton Subotnick Interactive music playback system utilizing gestures
US6696633B2 (en) * 2001-12-27 2004-02-24 Yamaha Corporation Electronic tone generating apparatus and signal-processing-characteristic adjusting method
US20040089141A1 (en) * 2002-11-12 2004-05-13 Alain Georges Systems and methods for creating, modifying, interacting with and playing musical compositions
US20050211071A1 (en) * 2004-03-25 2005-09-29 Microsoft Corporation Automatic music mood detection

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3979989A (en) * 1974-05-31 1976-09-14 Nippon Gakki Seizo Kabushiki Kaisha Electronic musical instrument
US4991218A (en) * 1988-01-07 1991-02-05 Yield Securities, Inc. Digital signal processor for providing timbral change in arbitrary audio and dynamically controlled stored digital audio signals
US5998724A (en) * 1997-10-22 1999-12-07 Yamaha Corporation Tone synthesizing device and method capable of individually imparting effect to each tone to be generated
US6570078B2 (en) * 1998-05-15 2003-05-27 Lester Frank Ludwig Tactile, visual, and array controllers for real-time control of music signal processing, mixing, video, and lighting
US20040099129A1 (en) * 1998-05-15 2004-05-27 Ludwig Lester F. Envelope-controlled time and pitch modification
US6696633B2 (en) * 2001-12-27 2004-02-24 Yamaha Corporation Electronic tone generating apparatus and signal-processing-characteristic adjusting method
US20030131715A1 (en) * 2002-01-04 2003-07-17 Alain Georges Systems and methods for creating, modifying, interacting with and playing musical compositions
US20030159567A1 (en) * 2002-10-18 2003-08-28 Morton Subotnick Interactive music playback system utilizing gestures
US20040089141A1 (en) * 2002-11-12 2004-05-13 Alain Georges Systems and methods for creating, modifying, interacting with and playing musical compositions
US20050211071A1 (en) * 2004-03-25 2005-09-29 Microsoft Corporation Automatic music mood detection

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011095913A1 (en) * 2010-02-02 2011-08-11 Koninklijke Philips Electronics N.V. Spatial sound reproduction
US9282417B2 (en) 2010-02-02 2016-03-08 Koninklijke N.V. Spatial sound reproduction
DK201300471A1 (en) * 2013-08-20 2015-03-02 Bang & Olufsen As System for dynamically modifying car audio system tuning parameters
US10142758B2 (en) 2013-08-20 2018-11-27 Harman Becker Automotive Systems Manufacturing Kft System for and a method of generating sound
US20160005415A1 (en) * 2014-07-04 2016-01-07 Arc Co., Ltd. Audio signal processing apparatus and audio signal processing method thereof
WO2017036559A1 (en) * 2015-09-04 2017-03-09 Arcelik Anonim Sirketi Image display device with automatic volume curve pairing function

Also Published As

Publication number Publication date
KR20080060641A (en) 2008-07-02
CN101211557A (en) 2008-07-02

Similar Documents

Publication Publication Date Title
US20080160943A1 (en) Method and apparatus to post-process an audio signal
US9239700B2 (en) System and method for automatically producing haptic events from a digital audio signal
JP6178456B2 (en) System and method for automatically generating haptic events from digital audio signals
JP4640463B2 (en) Playback apparatus, display method, and display program
US8761915B2 (en) System and method for automatically producing haptic events from a digital audio file
JP6480526B2 (en) Applicable music playback system
CN110211556B (en) Music file processing method, device, terminal and storage medium
DE102012103553A1 (en) AUDIO SYSTEM AND METHOD FOR USING ADAPTIVE INTELLIGENCE TO DISTINCT THE INFORMATION CONTENT OF AUDIOSIGNALS IN CONSUMER AUDIO AND TO CONTROL A SIGNAL PROCESSING FUNCTION
KR101534346B1 (en) Music playback apparatus, music playback method, and recording medium
CN101120412A (en) A system for and a method of mixing first audio data with second audio data, a program element and a computer-readable medium
US20240314499A1 (en) Techniques for audio track analysis to support audio personalization
JP2007534995A (en) Method and system for classifying audio signals
JP4330174B2 (en) Information selection method, information selection device, etc.
JP2006202127A (en) Recommended information presentation device and recommended information presentation method or the like
WO2017135350A1 (en) Recording medium, acoustic processing device, and acoustic processing method
Werthen-Brabants Ground truth extraction & transition analysis of DJ mixes
US9905208B1 (en) System and method for automatically forming a master digital audio track
JP4933330B2 (en) Music playback control device and playlist creation method
CN112185325B (en) Audio playback style adjustment method, device, electronic device and storage medium
Teyssier-Ramírez Smart Audio Equalizer
JP2006023524A (en) Analysis system and reproduction apparatus for acoustic signal
JP2008250113A (en) Music analysis method and audio apparatus

Legal Events

Date Code Title Description
AS Assignment

Owner name: SAMSUNG ELECTRONICS CO., LTD., KOREA, REPUBLIC OF

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SUNG, HO-YOUNG;LEE, JAE-CHEOL;REEL/FRAME:019352/0988

Effective date: 20070528

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

OSZAR »