Acoustic fingerprint: Difference between revisions
Facharbeit (talk | contribs) No edit summary |
|||
Line 51: | Line 51: | ||
== External links == |
== External links == |
||
* [http://www.luminescence-software.org |
* [http://www.luminescence-software.org Metatogger is a freeware that uses the AcoustID service for identifying audio files.] |
||
* [http://www.audiblemagic.com Audible Magic (audio & video image fingerprinting) ] |
* [http://www.audiblemagic.com Audible Magic (audio & video image fingerprinting) ] |
||
* [http://www.auditude.com Auditude Connect technology (audio and video fingerprinting) ] |
* [http://www.auditude.com Auditude Connect technology (audio and video fingerprinting) ] |
Revision as of 15:16, 6 August 2013
This article needs additional citations for verification. (June 2011) |
An acoustic fingerprint is a condensed digital summary, deterministically generated from an audio signal, that can be used to identify an audio sample or quickly locate similar items in an audio database.[1]
Practical uses of acoustic fingerprinting include identifying songs, melodies, tunes, or advertisements; sound effect library management; and video file identification. Media identification using acoustic fingerprints can be used to monitor the use of specific musical works and performances on radio broadcast, records, CDs and peer-to-peer networks. This identification has been used in copyright compliance, licensing, and other monetization schemes.
Attributes
A robust acoustic fingerprint algorithm must take into account the perceptual characteristics of the audio. If two files sound alike to the human ear, their acoustic fingerprints should match, even if their binary representations are quite different. Acoustic fingerprints are not bitwise fingerprints, which must be sensitive to any small changes in the data. Acoustic fingerprints are more analogous to human fingerprints where small variations that are insignificant to the features the fingerprint uses are tolerated. One can imagine the case of a smeared human fingerprint impression which can accurately be matched to another fingerprint sample in a reference database; acoustic fingerprints work in a similar way.
Perceptual characteristics often exploited by audio fingerprints include average zero crossing rate, estimated tempo, average spectrum, spectral flatness, prominent tones across a set of bands, and bandwidth.
Most audio compression techniques (AAC, MP3, WMA, Vorbis) will make radical changes to the binary encoding of an audio file, without radically affecting the way it is perceived by the human ear. A robust acoustic fingerprint will allow a recording to be identified after it has gone through such compression, even if the audio quality has been reduced significantly. For use in radio broadcast monitoring, acoustic fingerprints should also be insensitive to analog transmission artifacts.
On the other hand, a good acoustic fingerprint algorithm must be able to identify a particular master recording among all the productions of an artist or group. For use as evidence in a court of law, an acoustic fingerprint method must be forensic in its accuracy.[citation needed]
Implementations
This is a list of notable acoustic fingerprinting products.
- Proprietary
- All Media Guide's LASSO is a commercial service that uses acoustic fingerprinting, and other techniques, to recognize music. (U.S. patent 7,277,766)
- Audible Magic Corporation is a commercial venture that provides electronic media identification and copyright management solutions using proprietary acoustic fingerprinting technology U.S. patent 5,918,223 based on original research by Muscle Fish Consulting[2]
- AudioID is a commercial technology for automatically identifying audio material using acoustic fingerprints. It was developed by the German Fraunhofer Institute.
- BMAT Vericast[3] is a global music identification service that monitors millions of songs over 2000 radios and televisions across more than 50 countries worldwide. The solution provides real time recognition and auditable reporting based on an audio fingerprint that is resistant to signal alterations such as voice over, broadcast mastering or noisy channel degradation.
- YouTube's Content ID is able to identify audio/visual part of copyrighted content. In case if whole third party content is matched or only certain portion, user can check whether the content is properly identified or misidentified.
- Gracenote's MusicID is a commercial product that uses acoustic fingerprinting along with other methods to identify music.
- The Nero Multimedia Suite Nero (software suite) version 9 and 10 uses Gracenote to add metadata like author, title and genre to an audio file.
- Sony Ericsson's TrackID software uses Gracenote to identify songs being recorded via cell phone in a way similar to Shazam.
- Winamp version 5.5 uses Gracenote to power automatic playlist generation with "Nullsoft Playlist Generator" plugin that comes with the software.
- Midomi is a commercial service that can match music clips, as well as identifying a song that is sung or hummed
- Moodagent is a commercial service from Syntonetic that combines digital signal processing and AI techniques to create music profiles that incorporate characteristics such as mood, emotion, genre, style, instrument, vocals, orchestration, production, and beat/tempo.
- SoundHound, an acoustic fingerprint-based service with web and mobile platforms (Android, iOS, Windows Phone) that allows songs or hummed tunes to be identified using the Midomi service.
- Shazam, an acoustic fingerprint-based service allows for songs to be identified via cell phone.
- Tunatic by Wildbits is an application that allows identifying music while being played, analyzing the songs and comparing with the information on a server
- Open source
- MusicBrainz, a free and open content project for a music database that uses AcoustID's free database of audio fingerprints, which aims to map its fingerprints to the MusicBrainz database. MusicBrainz also used MusicIP's Open Fingerprint Architecture[4] for fingerprinting and the AmpliFIND (formerly MusicDNS) service for identifying audio files since 2006,[5] but is phasing out AmpliFIND in favour of the open source AcoustID, after AmpliFIND was acquired by Gracenote in 2011.[6]
- Last.fm's own acoustic fingerprinting application[7] was released in 2007.[8] The technology is now included in the Last.fm client software.
- AcoustID is an open source project that aims to create a free database of audio fingerprints with mapping to the MusicBrainz metadata database and provide a web service for audio file identification using this database.
- Echoprint is an open source music fingerprint and resolving framework powered by the The Echo Nest.
See also
References
- ^ ISO IEC TR 21000-11 (2004), Multimedia framework (MPEG-21) -- Part 11: Evaluation Tools for Persistent Association Technologies
- ^ "Content-Based Classification, Search, and Retrieval of Audio," IEEE MultiMedia, vol. 3, no. 3, pp. 27-36, Sept., 1996.
- ^ http://www.bmat.com/products/vericast/
- ^ LibOFA (Library Open Fingerprint Architecture)
- ^ Robert Kaye (mayhem) (12 March 2006). "New fingerprinting technology available now!". MusicBrainz Blog.
- ^ Picard 0.16 released with AcoustID support
- ^ lastfm/Fingerprinter · GitHub
- ^ Last.fm blog
External links
- Metatogger is a freeware that uses the AcoustID service for identifying audio files.
- Audible Magic (audio & video image fingerprinting)
- Auditude Connect technology (audio and video fingerprinting)
- (audio and video fingerprinting)
- Civolution (content identification with audio and video fingerprinting)
- New Media Lab broadcast monitoring service using audio fingerprinting technology.
- Template:MusicBrainz wiki
- A Review of Algorithms for Audio Fingerprinting (P. Cano et al. In International Workshop on Multimedia Signal Processing, US Virgin Islands, December 2002)
- PlayKontrol media monitoring automated clip matching - has REAL-TIME audio matching DEMO
- Wang, Avery Li-Chun (2003). "An Industrial-Strength Audio Search Algorithm" (PDF). Shazam Entertainment. Retrieved 2012-09-04.