Prototyping IV: Image Extender – Image sonification tool for immersive perception of sounds from images and new creation possibilities

Tests on automated audio file search via freesound.org api:

For further use in the automated audio file search of the recognized objects I tested the freesound.org api and programmed the first interface for testing purposes. The first thing I had to do was request an API-Key by freesound.org. After that I noticed an interesting point to think about using it in my project: it is open for 5000 requests per year, but I will research on possibilities for using it more. For the testing 5000 is more than enough.

The current code already searches with a few testing tags and gives possibilities to filter the searches by samplerate, duration, licence and file type. There might be added more filter possibilities next like rating, bit depth, and maybe the possibility of random file selection so it won’t be always the same for each tag.

Next steps would also include to either download the file or just play it automatically. Then there will be tests on using the tags of the AI image recognition code for this automated search. And later in the process I have to figure out the playback of multiple files, volume staging and filtering or EQing methods for masking effects etc…

Test gui for automated sound searching via freesounds.org API

David Adlberger is a sound designer and media artist based in Graz. With a technical background and a Bachelor’s degree in Media Technology from FH St. Pölten, he is currently pursuing a Master’s degree in Sound Design at FH Joanneum and Kunstuniversität Graz. His work explores the intersection of narrative, technology, and perception. Fascinated since childhood by the creation of sonic worlds, he combines technical and artistic experimentation. His practice ranges from film sound and immersive 3D audio to algorithmic composition and audiovisual installations.
Leave a Reply

Your email address will not be published. Required fields are marked *