黑料网

News

Acoustics researchers decompose sound accurately into its three basic components

Any sound can now be perfectly replicated by a combination of whistles, clicks, and hisses, with implications for sound processing across the media landscape

Researchers have been looking for ways to decompose sound into its basic ingredients for over 200 years. In the 1820s, French scientist Joseph Fourier proposed that any signal, including sounds, can be built using sufficiently many sine waves. These waves sound like whistles, each have their own frequency, level and start time, and are the basic building blocks of sound.

However, some sounds, such as the flute and a breathy human voice, may require hundreds or even thousands of sines to exactly imitate the original waveform. This comes from the fact that such sounds contain a less harmonical, more noisy structure, where all frequencies occur at once. One solution is to divide sound into two types of components, sines and noise, with a smaller number of whistling sine waves and combined with variable noises, or hisses, to complete the imitation.

Even this 鈥榗omplete鈥 two-component sound model has issues with the smoothing of the beginnings of sound events, such as consonants in voice or drum sounds in music. A third component, named transient, was introduced around the year 2000 to help model the sharpness of such sounds. Transients alone sound like clicks. From then on, sound has been often divided into three components: sines, noise, and transients.

The three-component model of sines, noise and transients has now been refined by researchers at Aalto University Acoustics Lab, using ideas from auditory perception, fuzzy logic, and perfect reconstruction.

Decomposition mirrors the way we hear sounds

Doctoral researcher Leonardo Fierro and professor Vesa V盲lim盲ki realized the way that people hear the different components and separate whistles, clicks, and hisses is important. If a click gets spread in time, it starts to ring and sound noisier; by contrast, focusing on very brief sounds might cause some loss of tonality.

Leonardo Fierro (vas.) ja professori Vesa V盲lim盲ki kokeilivat Sitranoa Aalto-yliopiston akustiikan laboratorion Otala-kuunteluhuoneessa. Kuva: Anna Berg / Aalto-yliopisto.
Leonardo Fierro (left) and Professor Vesa V盲lim盲ki demonstrated the enchanted decomposition method in the Otala listening room of Aalto Acoustics Lab. Photo: Anna Berg / Aalto University.

This insight from auditory perception was coupled with fuzzy logic: at any moment, part of the sound can belong to each of the three classes of sines, transients or noise, not just one of them. With the goal of perfect reconstruction, Fierro optimized the way sound is decomposed. 

In the enhanced method, sines and transients are two opposite characteristics of sound, and the sound is not allowed to belong to both classes at the same time. However, any of two opposite component types can still occur simultaneously with noise. Thus, the idea of fuzzy logic is present in a restricted way. The noise works as a fuzzy link between the sines and transients, describing all the nuances of the sound that are not captured by simple clicks and whistles. 鈥業t鈥檚 like finding the missing piece of a puzzle to connect those two parts that did not fit together before,鈥 says Fierro. 

This enhanced decomposition method was compared with previous methods in a listening test. Eleven experienced listeners were individually asked to audit several short music excepts and the components extracted from them using different methods. 

The new method emerged as the winning way to decompose most sounds, based on the listeners鈥 ratings. Only when there is a strong vibrato in a musical sound, such as in a singing voice or the violin, all decomposition methods struggle, and in these cases some previous methods are superior.

A test use case for the new decomposition method is the time-scale modification of sound, especially slowing down of music. This was tested in a preference listening test against the lab鈥檚 own previous method, which was selected as the best academic technique in a comparative study a few years ago. Again, Fierro鈥檚 new method was a clear winner.

鈥楾he new sound decomposition method opens many exciting possibilities in sound processing,鈥 says professor V盲lim盲ki. 鈥楾he slowing down of sound is currently our main interest. It is striking that for example in sports news, the slow-motion videos are always silent. The reason is probably that the sound quality in current slow-down audio tools is not good enough. We have already started developing better time-scale modification methods, which use a deep neural network to help stretch some components.鈥

The high-quality sound decomposition also enables novel types of music remixing techniques. One of them leads to distortion-free dynamic range compression. Namely, the transient component often contains the loudest peaks in the sound waveform, so simply reducing the level of the transient component and mixing it back with the others can limit the peak-to-peak value of audio.

Reference:

Fierro, L. & V盲lim盲ki, V. (2023). Enhanced Fuzzy Decomposition of Sound Into Sines, Transients, and Noise. Journal of the Audio Engineering Society. doi: 

Contact:

Leonardo Fierro
Doctoral Researcher, Aalto University, Department of Information and Communications Engineering, Acoustics Lab
leonardo.fierro@aalto.fi 

Vesa V盲lim盲ki
Professor, Aalto University, Department of Information and Communications Engineering, Acoustics Lab
vesa.valimaki@aalto.fi 
phone + 358 50 569 1176

 

Aalto University Acoustic Lab

Aalto Acoustics Lab

The Aalto Acoustics Lab is a multidisciplinary research center focusing on audio processing and spatial sound technologies. The laboratory gathers professors and research teams from three different units: Department of Information and Communications Engineering, Department of Computer Science, and Department of Art and Media.

  • Updated:
  • Published:
Share
URL copied!

Read more news

A conference hall filled with attendees sitting at tables, watching a presentation on a large screen.
Campus, Research & Art Published:

Physics Days 2026 gathered Finnish physicists 黑料网

The 2026 edition of the annual conference featured talks on moir茅 matter, women in physics and paper cuts.
A speaker addresses a large audience in a dark auditorium. A large screen behind shows a vibrant image with the text 'Welcome'.
Awards and Recognition, Research & Art Published:

Annual review looked back on the past year

The annual review of the School of Arts, Design and Architecture provided a comprehensive overview of the past year. Members of the community were also awarded in the event.
A person wearing a dark jacket stands outside a multi-storey building with many windows.
Awards and Recognition, Research & Art Published:

Alum of the Year Anna Brotkin: 鈥淲e need modern stories about our era鈥

Screenwriter Anna Brotkin is the Alum of the Year 2026 of the School of Arts, Design and Architecture. She believes in the power of locality and the importance of hope in times of crisis.
A white cylindrical machine with 'Aalto University' logo in an industrial setting.
Press releases Published:

Aalto University unveils AaltoQ20 鈥 a state-of-the-art quantum computer for educating quantum talent of the future

AaltoQ20 is a unique quantum computer that researchers can also use to study quantum phenomena and develop new technology.